Gene Haur_5094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5094 
Symbol 
ID5737052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp121325 
End bp122473 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content51% 
IMG OID641282259 
Productglycosyl transferase group 1 
Protein accessionYP_001547850 
Protein GI159901604 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAATTG CGCTTGTGTT ATCAACACCC TTACCGGCCT GTGAAGGTAT TGGGTTTTAT 
GTTTGGAATC TTGGACGCTT TTTGACCCAC CATGGGCATG AAGTGCATAT CATTACGCGT
GGTGAACCAA CGAAACCGGC CTATGAACAG GTGCAGGCGA TCCATATCTG GCGGCCAGCA
TTTTGGCGAA TCTATCCCTT CCATGTCGAT ATGCATGGCT ATTTTGTGAC CCAAACCCTC
GAAACGATTG CGAATACCTA TGGACTTGAT TTAATTCATG TTCATACCCC CCTCGTTAAA
ATTCCGAAGA GTGCCTATCC GGTCGTCGTT ACTGTGCATA CGCCAATGAA GACCGACACT
GCCGCCATTC CATTGCGATC GGTGTTTGAT ATGCTGATTA AGCTGCAAAC GCCATTTAGC
ATTCGTTTGG AAAAACGGCT TTTCCGACAA GCCACAACCA TCACAACTGT GGCGACGAGT
GTTGCCTCTG AATTAGGGGC CTATGGGTTG CAACCACATC AGGTAGCCGT AGTTGGGAAT
GGTGTCGATA CGGCGACCTT CTATCCGCCC GTTGATCTGC AAGCACGATT TCACCAGCGC
TACTTTTTAA CCGTTGGGCG ACTCGCACCG CGAAAAGGAT TAGAAGATTT AATTGCGAGT
GCAGCAGAAG TCGTTAAACG CTATCCTACC TATCGCTTTT TCATTGTCGG CCAAGGGCCG
CTCGCCGCAG TCCTACAAAA ACAAATTACC CAGCTTCACC TTGATCAGCA TGTGCAATTA
CTCGGTCATA TGGCGGATCG AGAACAGCTT GCCGATCTGT ATCGTGGGGC ATGGGCCTAT
ATCCACCCTG CCCATTATGA AGGGTTACCG ACGGCGTTGT TAGAGGCGAT GGCATGTGGC
TGTCCCGTGG TGGCAACGGC GGTGAGTGGT GCCCTTGATG TGATTACACC GCACAATGGG
GTATTGGTGA ACCCTCATGC CCCAGTGCAA TTAACACAGG CAGTATGTCG CTTCATTGAA
CAGCCACAGG TCGCACGGGA TCTCGGCCAG CAAGCAGCCT TGACGATCCA ACAGCAGTAT
GGGTGGACTG CGATAGGCCA ACGCTATCTT GCGACCTATC ACCATGCTAT CCAAGGAGCA
ACTGCATGA
 
Protein sequence
MRIALVLSTP LPACEGIGFY VWNLGRFLTH HGHEVHIITR GEPTKPAYEQ VQAIHIWRPA 
FWRIYPFHVD MHGYFVTQTL ETIANTYGLD LIHVHTPLVK IPKSAYPVVV TVHTPMKTDT
AAIPLRSVFD MLIKLQTPFS IRLEKRLFRQ ATTITTVATS VASELGAYGL QPHQVAVVGN
GVDTATFYPP VDLQARFHQR YFLTVGRLAP RKGLEDLIAS AAEVVKRYPT YRFFIVGQGP
LAAVLQKQIT QLHLDQHVQL LGHMADREQL ADLYRGAWAY IHPAHYEGLP TALLEAMACG
CPVVATAVSG ALDVITPHNG VLVNPHAPVQ LTQAVCRFIE QPQVARDLGQ QAALTIQQQY
GWTAIGQRYL ATYHHAIQGA TA