Gene Haur_4463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4463 
Symbol 
ID5736314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5707502 
End bp5708737 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content52% 
IMG OID641281626 
Productextracellular solute-binding protein 
Protein accessionYP_001547223 
Protein GI159900976 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00452134 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGCAG TATGGCGGTA TTTTAGCCTT TTTATGATTG GGATTCTCAT TGTTGCTGGT 
TGTAGCACGA CAAGCTATGA TAACACTTTG TTAACCCCAG GGGCTACCGC TAATCCTCCC
GCACGCACGC AACTGGTCAT TTGGCATGGA GCCGATGCCC AACGGAGTGA ACTTTTCACC
CGATTGCTGT TAGAATATCA ACGTGAGCAT CCAAACGTTG TCATTCAAAT CGTCAATCGT
GGGGCAAACC TGCTACACGA TTATCGTGCG GCCCTGCTTG AAGGCACACC ACCAGACCTG
ATCTGGCTCA ACGAAAATCG TTGGGTAGGG GAACTGGCAG ATCAACAGCT TATCATCGAT
CTGACAGAAC GACTAAGCGA CGAGAACCTT GAGTCAATTG CGCCAGCTGC GCTTGATGGT
GCCCGCTATG GCGAGAAATT ATATGGCTTG CCATTGACGC TAGATCTACC TGTGCTGTAC
TATAATCGTG CAAACTTTGT GAGCACACCG CCGCAAAGCA CTGCTGAATG GCTTGAGATC
GCTCGCGGGT TTAGCGATGA TCAAGGACAG TACGGATTAG CGTATAATTT ATCGCTATAC
TTTACCCAAC CCTACCTCCC AGCCTTCGGA GGCGCAATCT TCGATACTAC TGGCGAGGTC
GTGCTGGGAA CCCAAAGCTA TACCCCAACA TTACAGTGGT TGACGTGGGT TGACGAATTA
GCCCAAGACC CACGCTTGTT AGCCCGTGAT GATCATCGAC TGGTAGCTCG CAGCGTGAGC
CAAAACAGTG CGATCATGAC GATTGACTGG GCAGATCAAA TCGGAACCTA TCGTCAATTG
TGGGGGGAGA ACGTTGGCGT GCAACCATTG CCACGCCTGA GCCAAACCGG CCAAGAACCG
CAACCCTTTG TTCGGAGCAG CGTGCTTGTG ATCAGCCCAC GCAGCACTGA ACAACAGCAA
AACGCCGCGC TTGACGTGAT GCGGTTTCTT GTGGAGATGA AAGCGCAAAC GGCTTTTCAA
GCCGCTGATA TACCAAGCGT CCGAATCGAC TTAGCCAGTG TCGATCCACT TTACACTCAA
ATTCAACTGG CGGTTAGTCG AGCCAGCGCT TGGCCCACCA CCCTTCGTTT CAACAACGGA
TGGGATATCC TGATCGCTTT GGTGCGTAAT AGCTTAAACG GCGCACCCTT GGAAGAAAGT
ATCGCGAATG CCGATCGCCT GCTACGGAGC GAGTAG
 
Protein sequence
MRAVWRYFSL FMIGILIVAG CSTTSYDNTL LTPGATANPP ARTQLVIWHG ADAQRSELFT 
RLLLEYQREH PNVVIQIVNR GANLLHDYRA ALLEGTPPDL IWLNENRWVG ELADQQLIID
LTERLSDENL ESIAPAALDG ARYGEKLYGL PLTLDLPVLY YNRANFVSTP PQSTAEWLEI
ARGFSDDQGQ YGLAYNLSLY FTQPYLPAFG GAIFDTTGEV VLGTQSYTPT LQWLTWVDEL
AQDPRLLARD DHRLVARSVS QNSAIMTIDW ADQIGTYRQL WGENVGVQPL PRLSQTGQEP
QPFVRSSVLV ISPRSTEQQQ NAALDVMRFL VEMKAQTAFQ AADIPSVRID LASVDPLYTQ
IQLAVSRASA WPTTLRFNNG WDILIALVRN SLNGAPLEES IANADRLLRS E