Gene Haur_3300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3300 
Symbol 
ID5735170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4165051 
End bp4166178 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content50% 
IMG OID641280447 
Productglycosyl transferase group 1 
Protein accessionYP_001546064 
Protein GI159899817 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00341584 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGTGTGG CTCCCAAAAT TGCTTTTATT CGTAAGGGGC GCTGGCCGTT GGCGAATGTG 
CGAACCGCCG AAGCACTGCG TGCTCAATTC CCCGAATACG AACTCCGTGA GATCGATTTA
ATTCCCATCA TTCGACGCAA GCCTGCCTTG GTTGCATTAA ACGGCTGGTG GACATTACGC
CAATATGCTG GCGATTTGGC CATGCGGCGA CGTGGCCCCA AAGATGCCTT TTTGATCACT
AGTTATATTT TCCGTGCTGT CAAGCATTTA GTGGCCGATT TACTGCGCGA TGACGACTAT
CTGTTTAGTT TTCAGATGCA ATCATTATTT GATGCTAGCG TGCCAAATAT CCCGCATTTT
GTCTATACCG ACCATACCTT GCTGGCAAAT CGTCAATATC CAGGCTTTAA TCCGGCTTCA
CTCTATCACC CTGAATGGAT GAAGTTAGAG CCAACAATTT ATCAAAATGC CAATTTAGTC
TTTACACGTT CCAATCATGT TTCACGTTCA CTAGTCGAAG ATTACCATTG TGATCCAGCC
AAAGTACGTT GTGTTTACGC TGGCAGCAAT GCTCCGGTGA TCAGCGAACC GCCTGATCCA
GCTCGCTATG CCAGCCAAAA TATTGTCTAT GTTGGGATAG ATTGGGAGCG TAAAGGCGGC
CCTGAATTGC TGCAAGCTTT CGCCCAAGTG CGGGCGGTTT ATCCCAATGC CACCTTGACG
ATCATCGGCG CAAACCCTCA AACCAATCAG CCAGGGGTTG AGGTGATTGG GCGGATTCCG
GTTGAGCAAT TACCGCACTA TTATCAACGC GCCGCCGTCT TTTGCATGCC CACCAAACTT
GAGCCATTTG GCATCGTCAC GATTGAAGCC ATGAACTATT GGCTGCCTGT GGTTTCAACC
AATCTTGGAG CCATGCCCGA TTTTATCGAG CACGATCACA ACGGCTATTT GGTCGAACCA
GGCACGGTTG ATCAACTAGC CACCGCCTTG ATCAAGCTGG TTGGCGATCC TGAACGCTGT
CGGCGTTTTG GGGCACGCAG TGTCGAAATT GCGGCACGGT ATCGCTGGGA ATCGGTTGGT
TCGGCTATGC GTGAGGCAAT TATCCAGAAC ATAGAGCATA GAACATAA
 
Protein sequence
MRVAPKIAFI RKGRWPLANV RTAEALRAQF PEYELREIDL IPIIRRKPAL VALNGWWTLR 
QYAGDLAMRR RGPKDAFLIT SYIFRAVKHL VADLLRDDDY LFSFQMQSLF DASVPNIPHF
VYTDHTLLAN RQYPGFNPAS LYHPEWMKLE PTIYQNANLV FTRSNHVSRS LVEDYHCDPA
KVRCVYAGSN APVISEPPDP ARYASQNIVY VGIDWERKGG PELLQAFAQV RAVYPNATLT
IIGANPQTNQ PGVEVIGRIP VEQLPHYYQR AAVFCMPTKL EPFGIVTIEA MNYWLPVVST
NLGAMPDFIE HDHNGYLVEP GTVDQLATAL IKLVGDPERC RRFGARSVEI AARYRWESVG
SAMREAIIQN IEHRT