Gene Haur_4340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4340 
Symbol 
ID5736200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5547890 
End bp5549014 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content51% 
IMG OID641281501 
Productsolute-binding protein 
Protein accessionYP_001547100 
Protein GI159900853 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID[TIGR02634] D-xylose ABC transporter, substrate-binding protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTGC GGCGTAGCGC ATTCACTTCC CTCTTGTTGA TCCTCATCTC TAGCGTGATT 
GCAGCCTGTG GTAGCGAAAG CGCTACAACT GCTCCAACCA CAGGCACTGG TGGCACAACC
AGCAGTGGCG GCAAAGTAGT CGCTTTGTTC TTGCCCGATG CCAAAACTGC CCGCTATGAA
ACTGCCGACC GCCCCTACTT CGAAGCCAAA ATGAAAGAGC TTTGCCCAGA TTGTCAGGTG
ATTTACAACA ACGCCAACCA AGATGCCAGC TTGCAATTGC AACAAGCTGA AGCAGCGTTG
ACCAACGGCG CAAAGGTTTT GGTGCTTGAC CCAGTTGATT CTGCTGCTGC TGCTTCGATT
GCCGACAAAG CCAAAGCCCA AAATGTGCCA GTCATCGCCT ACGACCGCTT GATCCTTAAC
TCGGATGGCG TGAGCTACTA CATTTCATTC GACAACGAAT CAGTTGGCAA GTTGCAAGCC
GAAAGCTTGG TTGCGCAATT AGACAAGCAA GGGATTGCCA ACCCAACTAT CGTCATGATC
AACGGCTCAC CAACCGACAA CAATGCTAAA TTGTTCAAAG CTGGCGCTCA CAGCGTGTTT
GATCCATTGG TTAGCGCTGG CAAATTGACC ATCGCCAACG AATATGACAC TCCCGACTGG
AGCCCCGACA AAGCCCAAGA CCAAATGCAA CAAGCCTTGA CCAGCATGGG TAACAAAGTT
GATGGCGTAT ATGCTGCCAA CGACGGTACT GGTGGCGGGG CGATTGCCGC TATGAAGGCT
GGTGGTCTCT CACCATTGCC TCCAGTCACA GGCCAAGATG CTGAATTGGC GGCGATTCAA
CGGATTTTGG CTGGCGATCA ATACATGACC GTGTACAAAG CGATCAAGCC ACAAGCCGAA
GCTGCTGCTG AATTGGCCTT TGCCTTGCTC GAAGGCAAAA CCAGCGACAA AGCTACCAGC
AAAGTCAACA ATGGCAAAAT TGATGTTCCT TCAATCTTGC TGACCCCAAT TGCTGTGACC
AAAGAAAACG TCAAAGACAC GATTGTCAAA GACCAATTCC ACAAAGTTGA TCAGATCTGT
GCTGGCGACT TCGCCAAAGC CTGTGCTGAT GCCGGTATTC AATAA
 
Protein sequence
MNVRRSAFTS LLLILISSVI AACGSESATT APTTGTGGTT SSGGKVVALF LPDAKTARYE 
TADRPYFEAK MKELCPDCQV IYNNANQDAS LQLQQAEAAL TNGAKVLVLD PVDSAAAASI
ADKAKAQNVP VIAYDRLILN SDGVSYYISF DNESVGKLQA ESLVAQLDKQ GIANPTIVMI
NGSPTDNNAK LFKAGAHSVF DPLVSAGKLT IANEYDTPDW SPDKAQDQMQ QALTSMGNKV
DGVYAANDGT GGGAIAAMKA GGLSPLPPVT GQDAELAAIQ RILAGDQYMT VYKAIKPQAE
AAAELAFALL EGKTSDKATS KVNNGKIDVP SILLTPIAVT KENVKDTIVK DQFHKVDQIC
AGDFAKACAD AGIQ