Gene Gdia_2311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2311 
Symbol 
ID6975741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2561528 
End bp2563216 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content69% 
IMG OID643391839 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002276681 
Protein GI209544452 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.34367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0804809 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGGTC GCGCCGACCG CCGGACGCCC GTTCCTGCCC CGTTCCCGCC CAGGAGGCCG 
ACGCTGAGAC CCGATTGCTT TCCGGTGCTT TTCGCTGTTG CGGCCGTCGC GGCGGCAGGC
CTGCCGTCCG CGCACGCGCA GGGCCCGCTT CGGGGCGGGG AACTGATCTA TCTGGACCCC
CAGGCGCATA CCAACCTGTA TCCCCCGGCG GCCGGCTTCT ATCCCAACGG CGGAATCCTG
GACCAGGTCA CCGACCGGCT GACCTGGCAG AACCCGCAGA CGCTGGAAAT CGAGCCCTGG
CTGGCGCAAT CCTGGTCCAG CAACGCCGAC GATACCGAAT ACACCTTCCA CCTGCGCCCG
GGGGTGACGT TCTCGGACGG CACTCCGCTC GACGCGCGGG CCGTCGCCCT GAATTACGAA
ACCTACGGCA AGGGAAACCC GGCGCTTCAT TTCCCGGTTT CCGAGGTCAT TAACAATTTC
GACCATGCCG AAATCCTCGA CCCGCTGACC GTCCGGTTTC ATTTCACCCG TCCGTCGCCG
GGCTTTCTCC AAGGCACGTC TGTCATCGGC TCGGGGATCG TCTCGCCCGC CACCCTGGCG
CATCCGTTCG ACCAACTGGG CGTCGGCACG CAGGTCGTCG GCTCGGGCCC GTTCGTGATC
GTCCGCGACG TTCCGGGCAA GGAGGTCGAT CTGGTGGCGC GGCGCGACTA TGCCTGGCCC
CCGGCCTCGC GCGCGGGGCA GACGCGCGCG TGGCTGGACG GCGTGAAGAT CCTGGTAACG
CCCGAGGACA GTATCCGCGT CGGCGCGCTG CTGGCGGGCC AGGCCGACCT GATCCGCCAG
GTCGAGGCCT ATGACGAGGA ACAGGTCACC CTGGCCGGCT ATCGCCTCTA TGCCCCCTCG
ACGCGCGGGG TGAACACGGG GATCGCCTTC CGGCCGGACA ATCCCCTGGT GGCGGATATC
CGGGTGCGCG AGGCCCTGCT CCACGCCACC GACCGGCAGG AAATCGTCAC CACGCTCTAT
TCCGCCAACT ACCCCCTCGC GCGGTCGGTG CTGTCGGCCC GGGCCGCCGG CTTCCGCGAC
CTGTCCGACC GGCTGGGCTT CGATCCGCGG CGCGCCGCCC AATTGCTGGA CGAAGCGGGC
TGGCGCCTGG GGCCGGACGG GCTGCGCCAT CGGGACGGGC AGACGCTCGC GCTGGGCATC
CACATCTCGC AGCCGCATCC CCAGAACAGG ACGATGCTGG AACTGCTGGC CCAGCAATGG
CGCAGGGTCG GCGTCCAGCT GACCGTCATG TCCGGTTCGG CGGCGGGCGT CATCCTCGAC
AATCTGGACC CGACGCGCAC CCCCGTCACG GTGTCCGAGG TCGGGCGCGC GGACCCGGAC
GTGATGAAGA GCGAATTCTT CCCGTCCAAC CGGGACACGC TTCTGCAAAA GGGCGGCCAG
AGCGCCAAGG TGCGCGCATT CCGTGACGAC CGGCTGGACG CGATGCTGCT GCAGGTCGCC
TCGGACACCG ACCGGCAGGA CCGGCTGCGG CACCTGGGCG ACGTTCAGGA GTATATCGTG
CAAAACGCCT ATACGATCCC GATCTTCGAG GAACCCCAGG TCTATGCCGG GGCGCCCGGC
GTCCACGGCG TCGGCTTCGA GGCCGTGGGC CGCCCCAGCT TCTACGGCAT ATGGCTGGAC
CGGCGATGA
 
Protein sequence
MCGRADRRTP VPAPFPPRRP TLRPDCFPVL FAVAAVAAAG LPSAHAQGPL RGGELIYLDP 
QAHTNLYPPA AGFYPNGGIL DQVTDRLTWQ NPQTLEIEPW LAQSWSSNAD DTEYTFHLRP
GVTFSDGTPL DARAVALNYE TYGKGNPALH FPVSEVINNF DHAEILDPLT VRFHFTRPSP
GFLQGTSVIG SGIVSPATLA HPFDQLGVGT QVVGSGPFVI VRDVPGKEVD LVARRDYAWP
PASRAGQTRA WLDGVKILVT PEDSIRVGAL LAGQADLIRQ VEAYDEEQVT LAGYRLYAPS
TRGVNTGIAF RPDNPLVADI RVREALLHAT DRQEIVTTLY SANYPLARSV LSARAAGFRD
LSDRLGFDPR RAAQLLDEAG WRLGPDGLRH RDGQTLALGI HISQPHPQNR TMLELLAQQW
RRVGVQLTVM SGSAAGVILD NLDPTRTPVT VSEVGRADPD VMKSEFFPSN RDTLLQKGGQ
SAKVRAFRDD RLDAMLLQVA SDTDRQDRLR HLGDVQEYIV QNAYTIPIFE EPQVYAGAPG
VHGVGFEAVG RPSFYGIWLD RR