Gene Haur_2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2247 
Symbol 
ID5734134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2864289 
End bp2865896 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content50% 
IMG OID641279388 
Productextracellular solute-binding protein 
Protein accessionYP_001545015 
Protein GI159898768 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0125251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAC GTTTATCGAT TTTAGGTCTC TTGCTACTGG TTTTAGTTGG CTGCGATCTT 
GGGAGCAACC AAGCTACCCC AGCACCTGCA TCAACTCCAG TAGCCCAAAG CCAAGCTGGT
GGCGGCGGCA ATTTTATTTT GACGCTTGGC GATGATCCAG CGACGATCGA CCCAGCATTG
GTTGGCGATA CGACCAGCGG TTTTATTGCC CGCTTGATGT TTAGCGGTTT GGTGACGCTC
AACAACGAGC TTGAAGCAGT CCCCGATTTA GCTGAAACGA TCGATGTTTC GGCTGATGGC
ACGGTATATA CCTTTAAACT GCGCTCGAAT GCACGCTTTG CCGATGGTAC GCCGATTACT
GCTGAAGATG TGCGTTGGAG TCTCGAACGG GCCACCGACC CTAGCCTTGG CTCAATTGTT
TCGCCAACCT ACCTCGATGA TGTTGCTGGA GTGCTTGAAA AAGTGACTGG GCAAGCCAAC
TCGCTCAGTG GGGTCAAGGT GGTTGATGAT CAAACAATCG CGATTACGCT GCGCCAGCCA
AGCTCGCTAT TTTTGCTCAA ATTGACTCAC CCGCCAGCCT TTGTGCTCGA TCGGCGCACA
GTTGAGGACA ATAGCGATTG GCTCGAAAAA CCCAATGGCT CAGGCCCGTT TATGCTCGAT
CTATGGAATC ATCGCCGCCG CATGGAGCTT GTGCCCAACC CATACTACTA TGGCACTGCC
CCCAAATTGG ATCGGATTAC CTATCTGATT GGGGCGGAAG GTAGTAATCC GCTGGGGTTG
TATGAGCAGG GCGAAATCGA CGTGACGGGC ATTGGCAGCT ATGATCTTGA TCGGATTAAT
GATGAGGCTG ATCCGTTGCA CGCTGAGCTG CGGATCACGC CACAGTTGCA ATTAAGCTAT
ATTGGCTTGA ATGTGAATCA ACCGCCATTT GACGATCCGA AGGTGCGCGA AGCCTTTTAT
TTGTTGATCG ATCGGGTTAA ATTGGCCGAT GTTTCGTACA ATGGCTCGGT GGTCGCAGCC
CGGGGGATTT TGCCACCTGG CATGCCTGGA GCCGAGCCAG AACGTTTGCC TGAGCCAAAC
GCCGATATTG CCCGTGCCAA ACAACTAATC AGCGAATCGA GCTATGGTAG CGTTGAAAAG
TTTCCGCCGA TTATTGGCTA TAGCAGCGGT AGCGGGGTAG GTTTGCTGGC CCAAATTGCC
AAAGATGAGC TTGGGGTAAC GATTGAAATT CGTGGTCAAG ATCAATTTGG CGATTATTTG
GCAGCACTTG AGCGCGATAA TTATCATTTG TATGACCTCA GTTGGATCGC CGACTATCCC
GATCCACAAA ACTTTTTAGA GGTGTTGTTT GGCAGCAAGG GTCAATACAA TCGCACCAAT
TACAGCAACG CGAAGTTTGA TCAACTGATT GAGCAAGCCA AAGCCGAGGC CGATGCTGAA
AAACGTGGGG CACTCTATCG CCAAGCCGAA GAGCAATTGC TCAGTGATTT TGTGGTCATT
CCTTTGGTGC ATACTGTTGA TTATTCCTTG GTAAAATCGT ATGTTGATGG TTACATGATT
ACGGCTTTGG GTGAGCTAGA TTTAACTGGA GTTTCGCTCA AACGCTAG
 
Protein sequence
MDKRLSILGL LLLVLVGCDL GSNQATPAPA STPVAQSQAG GGGNFILTLG DDPATIDPAL 
VGDTTSGFIA RLMFSGLVTL NNELEAVPDL AETIDVSADG TVYTFKLRSN ARFADGTPIT
AEDVRWSLER ATDPSLGSIV SPTYLDDVAG VLEKVTGQAN SLSGVKVVDD QTIAITLRQP
SSLFLLKLTH PPAFVLDRRT VEDNSDWLEK PNGSGPFMLD LWNHRRRMEL VPNPYYYGTA
PKLDRITYLI GAEGSNPLGL YEQGEIDVTG IGSYDLDRIN DEADPLHAEL RITPQLQLSY
IGLNVNQPPF DDPKVREAFY LLIDRVKLAD VSYNGSVVAA RGILPPGMPG AEPERLPEPN
ADIARAKQLI SESSYGSVEK FPPIIGYSSG SGVGLLAQIA KDELGVTIEI RGQDQFGDYL
AALERDNYHL YDLSWIADYP DPQNFLEVLF GSKGQYNRTN YSNAKFDQLI EQAKAEADAE
KRGALYRQAE EQLLSDFVVI PLVHTVDYSL VKSYVDGYMI TALGELDLTG VSLKR