Gene Haur_4009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4009 
Symbol 
ID5735870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5115669 
End bp5116754 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content54% 
IMG OID641281159 
Productglycosyl transferase group 1 
Protein accessionYP_001546769 
Protein GI159900522 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATAG CTTTAGTTCA TGATTATTTG AATCAATATG GTGGCGCGGA ACGGGTGCTC 
GAAGTGTTGC ACGCCATGTT TCCGCAAGCC CCAATTTATA CATCAATTTA CGATGCCGAG
GCCATGCCCA GCCACTATCG CAGTTGGGAT ATTCGCACCT CGTTTATGCA AAAGCTGCCA
GGTTGGCGCA AGCATTTTCG TAAATATTTT TTGCTCTATC CCAGTGCCTT CGAGCATTTC
GACCTGAGCG CCTATGATTT GGTGATCAGC TCATCGAGCG CTTATGCCAA GGGCGTAATA
ACCAAACCTG GCGCTCGCCA TGTGTGCTAT TGCCACACCC CAATGCGCTT TGCTTGGCGC
ACCGACGATT ACGTTAAACG CGAGCAGATC AGTGGGATCT TCGGGGCGAT TCTGCCCTTC
TTTTTGACCT ACCTGCGCAT GTGGGATGTC CAATCGTCAG GCCGCGTAGA TCGCTTTATT
GCCAACTCGC GCACGGTTGC TGATCGGATT GACCATTTCT ACAAACGCCC TTCAACAATC
ATCACGCCGC CAGTTGAATT GCAGCCATTC GAGCCACAAC CAGCCGAAGA TTTTTATTTG
GCGGGCGGGC GGCTTGTGCC CTACAAACGG CTTGATTTGG CGATCAAAGC GTGTACCAAA
CTTGGTTTGC CCTTGGTGAT TTTTGGCGAT GGCCGCGATC GCGCCGAGCT TGAAAAAGTG
GCAGGGCCAA GCGTGCGCTT CGTTGGCAAA GTTGATGACG CGACCTTGCG CAGTTTATAT
GCCCGTTGTC GCGCCTACCT CATGCCAGGC GAAGAAGATG CAGGCATTCA GCCGCTCGAA
GCCATGGGTG CAGGCCGCCC TGTGATCGCC TACCAAGCAG GTGGGGCACT CGATAGCGTG
ATCGAAGGCC AAACTGGGCG CTTTTTCAGC CAACAAACCG TCGAAGATCT GGCGGCGGCC
ATCCTTGCCA GCCAAAACGA TCACTACGAG CCAACGGCGA TTCGCGCTCA TGCCGAGCAA
TTTGCCCGCC CCGCCTTCGA GGCGCGGATT CGGGCCGAAG TCGAAGCGGT GTTAAACGAA
GGATGA
 
Protein sequence
MQIALVHDYL NQYGGAERVL EVLHAMFPQA PIYTSIYDAE AMPSHYRSWD IRTSFMQKLP 
GWRKHFRKYF LLYPSAFEHF DLSAYDLVIS SSSAYAKGVI TKPGARHVCY CHTPMRFAWR
TDDYVKREQI SGIFGAILPF FLTYLRMWDV QSSGRVDRFI ANSRTVADRI DHFYKRPSTI
ITPPVELQPF EPQPAEDFYL AGGRLVPYKR LDLAIKACTK LGLPLVIFGD GRDRAELEKV
AGPSVRFVGK VDDATLRSLY ARCRAYLMPG EEDAGIQPLE AMGAGRPVIA YQAGGALDSV
IEGQTGRFFS QQTVEDLAAA ILASQNDHYE PTAIRAHAEQ FARPAFEARI RAEVEAVLNE
G