Gene Haur_0284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0284 
Symbol 
ID5732179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp331065 
End bp333407 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content52% 
IMG OID641277408 
Productglycosyl transferase group 1 
Protein accessionYP_001543064 
Protein GI159896817 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGACAC CGATTCGCTC AATCTTAATC ATTGGAAATT ATGCTCCGCG CCAGTGTGGC 
ATAGCAACCT ATACCACCGA TTTACGCATG GCCTTGCTTG ACGCGTATCC GCAGAGCAGG
ATCGACGTGA TGGCCATGAA TGATACTCCC GCAGGCTATG ACTACCCCGA TTCAGTCGTC
TTTACGATTG ATCAGGATGA CCCCTATGCC TATTATCAAG CTGCCGATTT TATCCGCTTG
AGTTCGTATG ACCTTGTGTG TATTCAGCAC GAATATGGGA TTTTTGGTGG TGCTTCGGGG
CGGAATCTGC TCTTGCTCAT TCGTGCTATC ACGATTCCGA TTGTAACCAC CTTACACACG
GTGTTACGCG AACCAACCGC CGATCAACAT ACGATTCTGT GGGAGCTTGC GCAACGATCA
CAACGGATTA TCGTGATGAG TTCGCATGCG GTCGATTTAA TGCATACAAT CTATGGTATT
CCATTGGCTC AGATTGATTG TATTCCGCAT GGCATTCCTG ATCTGCCGTT TCGCGATGGC
TATGAGGATA AACAACACGC GAATCTGATT GGGAAGCAGG TTTTGCTTAC CTTCGGCTTG
CTTTCGCCAA ATAAAGGGAT TGAAGATGTC TTACATGCGT TGCCATTGCT GATCAAACAG
CATCCGCATG TGCTGTATCT GATTGTTGGC GCAACCCATC CAACCGTTCG TCAAACCTTT
GGCGAGGCCT ATCGGGAGAT GCTTCAGGCC TTAGTCGAGC AACTTGGAAT TCAAGCGCAT
GTGCGGTTTC ACGACCAATT TGTGAGTTCC AGCGCTTTAG CAATTTATAT GGGTGCGGCT
GATATTTATA TTACGCCCTA TCATACCCAA GAACAAAGCG TCTCGGGTAC TTTAGCCTAT
GCGATTGGGG CAGGCAAGGC GATTGTCTCG ACACCCTATT GGTATGCAAC CGAACTCTTG
GCCCATGGTG GTGGCATGCT GGTTCCGTTT CATGATCCAG CGCTGCTCGC TGAGCAGGTT
AACACGCTTT TAGCCGAACC CCAGCTGCGT CAAACCATCC GCGAACGCGC CTATCAACGC
GGGCGCACGA TGCTCTGGTC GGTTGTTGCC ACGCACTATA TGCACAGTTT TATGCAGGCA
CGAACGCACC CCCTCCATCC CGTGCTAGTG CTGGCGAATC AACCAATCAA GTCCGATTCG
CCGATCATCC CGCCGTTATG TCTTGATCAT CTGATCGCCA TGACTGACGA TATGGGCTTG
ATCCAACATG CGATCTTGAA TATTCCGAAT CATCACGAAG GCTATGCGAC CGATGATAAT
GCGCGAGCCT TGATTGCCAC GATGCTGCTC GATCCTATCC AAGAACCACA GGCCCAGCGC
TTGGCAATGC GCTATTTAGC GTTTCTCTGG TATGCCTTTA ATCCAGCGAC CCAACGGTTT
CGCAACTTTA TGGGGGCAAA TCGGCAGTGG TTAGAGGCGA CAGGCTCGGA AGATGCCCAT
GCTCGCAGTA TCTGGGCACT TGGCACGGTG CTTGAACAGA GCCATGATCC GGGCTTGTGC
GGCGTTGCGC AACGACTCCT CCGTTTTGCG CTTCCAGCGG TGGCTCAGCT GACTCACCCA
CGGCCATGGG CCTTAGCATT GTTGGGTTTC GCTGCCTATC GTCAACGCTT TCCTGGTGAT
CGCACGGTTA TGGCCAGCCA ATTGCACCTC GCTGAACAAT TATTGTCCCG CTTTCAAGCG
GCGCATCAGC CTGATTGGGA ATGGTTTGAT GATCATCTGA CCTATGATAA TGCGGTTCTG
CCGCATGCAT TGATTGTCAG CGGTCAAACC CTTCAGCGCC CAGATATGGT TGAGGCTGGG
TTAACGGCCT TGACCTGGCT CTGTGCGATT CAGCGGCCTG AGGCTGACCA TTTCAAGCCC
ATTGGCTCGA ATGGGTTTTT CCAGCGTGGG CAAGCTCCAG CCCACTATGA TCAACAGCCA
ATTGAGGCTC AGGCAACCGT GCTGGCAGCC TGTGCCGCCT TTGAAAGCAC AGGCGATTCA
GGCTGGTACG ATGAAGCCCA GCATGCCTTT TATTGGTTTC TCGGCCACAA CGATGCAGGC
GTAGCCCTTT ACGACCCGCG CACTGGTGGC TGTGCTGATG GGCTGGAAAT TGATCGGATT
AATCAAAATC AGGGGGCTGA ATCAACCCTC GCATTTTTAA TTGGCCGATT GACGATGCAG
ACGATCACGC CACCAATCAG GCGCGGAACA GCGGAAAACA CGTCAGCCAG CGTCACCCGA
CCACCGCGCA TTCGAGCGGA TTTGATTCCG CCACGTGATC CCAATCCATC AAGACCATTG
TAG
 
Protein sequence
MPTPIRSILI IGNYAPRQCG IATYTTDLRM ALLDAYPQSR IDVMAMNDTP AGYDYPDSVV 
FTIDQDDPYA YYQAADFIRL SSYDLVCIQH EYGIFGGASG RNLLLLIRAI TIPIVTTLHT
VLREPTADQH TILWELAQRS QRIIVMSSHA VDLMHTIYGI PLAQIDCIPH GIPDLPFRDG
YEDKQHANLI GKQVLLTFGL LSPNKGIEDV LHALPLLIKQ HPHVLYLIVG ATHPTVRQTF
GEAYREMLQA LVEQLGIQAH VRFHDQFVSS SALAIYMGAA DIYITPYHTQ EQSVSGTLAY
AIGAGKAIVS TPYWYATELL AHGGGMLVPF HDPALLAEQV NTLLAEPQLR QTIRERAYQR
GRTMLWSVVA THYMHSFMQA RTHPLHPVLV LANQPIKSDS PIIPPLCLDH LIAMTDDMGL
IQHAILNIPN HHEGYATDDN ARALIATMLL DPIQEPQAQR LAMRYLAFLW YAFNPATQRF
RNFMGANRQW LEATGSEDAH ARSIWALGTV LEQSHDPGLC GVAQRLLRFA LPAVAQLTHP
RPWALALLGF AAYRQRFPGD RTVMASQLHL AEQLLSRFQA AHQPDWEWFD DHLTYDNAVL
PHALIVSGQT LQRPDMVEAG LTALTWLCAI QRPEADHFKP IGSNGFFQRG QAPAHYDQQP
IEAQATVLAA CAAFESTGDS GWYDEAQHAF YWFLGHNDAG VALYDPRTGG CADGLEIDRI
NQNQGAESTL AFLIGRLTMQ TITPPIRRGT AENTSASVTR PPRIRADLIP PRDPNPSRPL