Gene Haur_3659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3659 
Symbol 
ID5735520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4599898 
End bp4601037 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content48% 
IMG OID641280808 
Productmultiple sugar-binding periplasmic receptor ChvE precursor 
Protein accessionYP_001546423 
Protein GI159900176 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATGA AGTCTCCTCG CGCGAAGTTC ATGATGGTTT TGATGTTGCT CATGACAATG 
GTTTTGGCCA GCTGTGGCGA AGCCGCAACT CCAACCACTG CGCCAACCGA TGGCGGCACT
GGTGGCACAA CCGCCAGCAG CGATAACAGC AAATTCACCA TTGGGATCTC AATGCCCACC
AAATCATCAG CCCGCTGGAT TGCCGACGGC GATAATATGG TGAAATATTT CCAAGAAAAA
GGCTTCAAAA CCGACCTTCA ATACGCTGAA GACGATATCC CAACCCAACT TTCACAAATT
GAAAACATGG TCACCAAAGG GGTCAATGTT TTGGTGATTG CGGCAATCGA TGGCGAAACC
CTCTCAGATG TTCTGCAAAG TGCTAAAGAC AAGAAAATTC TGGTTATCGC CTACGACCGC
TTGATCAAGA AAACCCCCAA CGTTGACTAC TACGCCACCT TCGATAACTT CCAAGTTGGG
GTCTTGCAAG CCCAATCAAT CGAAACCAAA CTTGGTTTGA AAGAAGGCAA AGGCCCATTC
AATATCGAGT TGTTCGGTGG CTCATCCGAC GATAACAACG CCTTCTTCTT CTACAATGGC
GCTATGTCGG TCTTGCAACC TTACATCGAT AGCGGCAAGT TGGTGGTTGG TAGCGGCCAA
ACTGGTATGG ATAAAGTTGC CACCCTGCGC TGGGACGGTG CTACCGCCCA ATCACGCATG
GACAACATTT TGAGCGCCTT CTACGGCGAC AAACGGGTTG ATGCAGTGCT TTCACCATAC
GATGGTATCA GCATCGGGAT CATCTCATCG CTCAAGGGTG TTGGCTACGG TAGCGCCGAC
AAACCAATGC CAGTCGTTTC AGGCCAAGAT GCTGAAGTGC CCTCAGTCAA GTCAATTATC
GCTGGCGAAC AAAGCTCAAC CATCTTCAAA GATACCCGCG AGCTTGCTAA ATCAGTGGTA
GGTATGGTCG AAGCATCATT GTCAGGCAAA GAAGTTGCCG TCAACGATAC CAAAACCTAT
GACAATGGGG TCAAAGTTGT TCCTTCACAA TTGCTGGTTC CGGTGGTCGT CGATGTAACC
AACTGGGAAA AAGTATTGAT CGACAGTGGT TACTACAAAA AAGAAGACAT AACCAAATAA
 
Protein sequence
MTMKSPRAKF MMVLMLLMTM VLASCGEAAT PTTAPTDGGT GGTTASSDNS KFTIGISMPT 
KSSARWIADG DNMVKYFQEK GFKTDLQYAE DDIPTQLSQI ENMVTKGVNV LVIAAIDGET
LSDVLQSAKD KKILVIAYDR LIKKTPNVDY YATFDNFQVG VLQAQSIETK LGLKEGKGPF
NIELFGGSSD DNNAFFFYNG AMSVLQPYID SGKLVVGSGQ TGMDKVATLR WDGATAQSRM
DNILSAFYGD KRVDAVLSPY DGISIGIISS LKGVGYGSAD KPMPVVSGQD AEVPSVKSII
AGEQSSTIFK DTRELAKSVV GMVEASLSGK EVAVNDTKTY DNGVKVVPSQ LLVPVVVDVT
NWEKVLIDSG YYKKEDITK