Gene Gdia_0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0848 
Symbol 
ID6974245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp964466 
End bp965827 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content65% 
IMG OID643390377 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002275253 
Protein GI209543024 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0413942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.363265 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCATT CCACGACTTC GCGGCGCTGG TCCGGTGCCT CGATGCAGAT TCTCGGCATG 
CTGGTCCTGG GCACCGCATC CTGCCTCGGC ATGGGCGCGG CGTCGCCGGC CCATGCGGCC
GGCACGCTGA CGATCGCCAC GGTCAACAAT GCCGACATGG TCGTCATGAA ACAGTTATCG
GGCGAATTCG AGACGGCGCA TCCCGACATC CACCTGAACT GGGTGACGCT GGAGGAAAAC
GTCCTGCGCC AGCGGGCGAC GACCGACATC GCGACCCATT CCGGCCAGTT CGACATCCTG
ACGATCGGCA ATTACGAGGT GCCGATCTGG GCCAAGCAGG GATGGCTGGC AGAACTGCAC
CCCGATGCCG CCTATGATGC GGACGACATC CTGCCGGCGG TACGTGCCGG CCTGACGGTG
AACGACAAGC TGTATGCCCT GCCGTTCTAT GCCGAAAGCG TCATGACCTA TTACCGCAAG
GACCTGTTCG CCAAGGCCGG GCTGACGATG CCCGACGCCC CGACCTATGA CCAGATCCGC
ACCTTCGCCG ACAAGATCAC GGACAAGGAC AGCCAGACCT ATGGCCTGTG CCTGCGCGGC
AAGCCGGGCT GGGGCGAGAA CATGGCCTAT GTCACGTCGC TGGTGAACAC GTTCGGCGGC
CAGTGGTTCG ACATGACCTG GCACCCCATG CTGAACAGTC CGGAATGGAA GGCGGCGCTG
ACCTGGTACG TGTCGGCCCT GAAGGCCGAT GGCCCGCCCG GGGCGACATC GAACGGATTC
AATGAAAACC TGGCGCTGTT CGCCAGCGGC CATTGCGGCA TCTGGATCGA TTCCACGGTG
GCCGGCGGCC TGCTGTTCGA TCCGGCGCAA TCACACGTGG CCAATACGGT AGGCTTCGCT
TCCGTGCCGC TGGGGCCATA CGGCAAGGGA CCGACCTGGC TGTGGAGCTG GAACCTGGCC
ATTCCCGCAT CGTCCACCCA TGTGGCCGAC GCCCAGACCT TCATCACCTG GGCGACGTCG
AAGGCCTATG TGCAACTGGT GGCGAAGAAC CGCGGTTGGG TCGCAGTGCC CGCCGGCACG
CGCCTGTCGA CCTACAACAC CCCGGAATAC CAGAAGGCAG CGCCGTTCGC GGCGTTCGTC
CATAACGCCA TCGACCATGC CGATCCGAAC GGGCCGACGA AGCAGCCGCG CCCCTATGGC
GGCGCGCAGT TCGTCGCCAT CCCGCAGTTC CAGGCCATCG GCACCCAGGT CGGCCAGAGC
GTCGCCGCCG CCCTGTCGGG CCAGACGACG GTCGAGCAAA CCCAGGCCTC GGCCCAGGCG
TTGGTGACAC GCACCATGCG GCAGGCAGGC CTGCTGCATT AG
 
Protein sequence
MAHSTTSRRW SGASMQILGM LVLGTASCLG MGAASPAHAA GTLTIATVNN ADMVVMKQLS 
GEFETAHPDI HLNWVTLEEN VLRQRATTDI ATHSGQFDIL TIGNYEVPIW AKQGWLAELH
PDAAYDADDI LPAVRAGLTV NDKLYALPFY AESVMTYYRK DLFAKAGLTM PDAPTYDQIR
TFADKITDKD SQTYGLCLRG KPGWGENMAY VTSLVNTFGG QWFDMTWHPM LNSPEWKAAL
TWYVSALKAD GPPGATSNGF NENLALFASG HCGIWIDSTV AGGLLFDPAQ SHVANTVGFA
SVPLGPYGKG PTWLWSWNLA IPASSTHVAD AQTFITWATS KAYVQLVAKN RGWVAVPAGT
RLSTYNTPEY QKAAPFAAFV HNAIDHADPN GPTKQPRPYG GAQFVAIPQF QAIGTQVGQS
VAAALSGQTT VEQTQASAQA LVTRTMRQAG LLH