Gene Haur_4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4083 
Symbol 
ID5735942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5216028 
End bp5217227 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content51% 
IMG OID641281235 
Productglycosyl transferase group 1 
Protein accessionYP_001546843 
Protein GI159900596 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGTCT TGATGTTGGC TTGGGAATAT CCACCGCATA TTGTTGGCGG GATGGGCAAG 
CATATTGCTG AGTTGGTTCC AGTGCTTGAT GCGGCTGGAA TCGAAGTGCA TGTGCTTACG
CCATGGTTGC GTGGCGGCCC CCAACATGAG CGTTTTGGCA TGCACAGCCA TATTTGGCGG
GTTCAGCCAC CAGCCATGCC CGATTATGGC TTTGTTTCGT TTACCCAAGA AACCAACCGC
TATCTTGAGC GCTTCGCCCA CGATTTGGGC AAAACCCATG GCCCATTTGA TCTGATTCAT
GGCCATGATT GGCTGACCAG CTATTGTAGC GTCGCTTTGA AATATGCTTG GCATACTCCC
TTGATTACAA CGATTCATGC AACTGAACGC GGGCGTGGAC GTGGCTCGCT GGGCGGCGAT
CATGCCAAAA CGATTAATGG CTTGGAATGG TGGTTGGCCC ACGAAAGTTG GCGGGTAATT
GTGTGCAGCG ATTTTATGGC CGACCAGTTG CATCAATTTT TTGGCACGCC CTTTGATAAA
CTCGATGTGA TTGCCAACGG CGTGAATGTG CCAACGATTG AATGGCCGAG CCAAGAGCGC
CAGCAATTTC GCCAAAAATA TGCCGCTGAT AACGAAAAAG TAGTGTTTAG CATTGCCCGC
ATGGTCTACG AAAAAGGCAT TCAAGTGTTG GTTGAGGCAA TTCCACATGT CTTGGCGCAA
CGCCGCGATA TCAAATTTGT GATTGCGGGC ATGGGGCCGT TAGCCGAACA ATTGCGCAAC
CGCAGCCGTG AGCTTGGCAT CGATGCACAT GTTTATTGGA CGGGCTTCGT GACCGATCAA
GATCGCAATT ATCTCTACAA TGTTGCTGAT GTGGCAGTAT TTCCCAGCAT CTACGAGCCA
TTTGGGATTG TAGCCTTGGA AGCGATGGCG GCACATTGCC CGGTCATTGT TTCGGATACT
GGTGGTTTGC GTGAGGTGGT GCAAATTCAC GAAACAGGCT TAACGGTTTA CCCTGATAAC
CCTGAATCGT TGGCTTGGGG CATTTTGCAT ACGCTGTCCC ACCCCGAATG GACCCAGCAA
CGAGTTGAAA ATGCCTTCAA AACGGTGGTT GAGATTTATA ATTGGCCATT AATTGCCTGC
CAGACCCAAG CCGTCTATCA ACGGGTTTGC GACGAGCGTG CAACTAGTAT GTGGGGATGA
 
Protein sequence
MRVLMLAWEY PPHIVGGMGK HIAELVPVLD AAGIEVHVLT PWLRGGPQHE RFGMHSHIWR 
VQPPAMPDYG FVSFTQETNR YLERFAHDLG KTHGPFDLIH GHDWLTSYCS VALKYAWHTP
LITTIHATER GRGRGSLGGD HAKTINGLEW WLAHESWRVI VCSDFMADQL HQFFGTPFDK
LDVIANGVNV PTIEWPSQER QQFRQKYAAD NEKVVFSIAR MVYEKGIQVL VEAIPHVLAQ
RRDIKFVIAG MGPLAEQLRN RSRELGIDAH VYWTGFVTDQ DRNYLYNVAD VAVFPSIYEP
FGIVALEAMA AHCPVIVSDT GGLREVVQIH ETGLTVYPDN PESLAWGILH TLSHPEWTQQ
RVENAFKTVV EIYNWPLIAC QTQAVYQRVC DERATSMWG