Gene Gdia_1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1456 
Symbol 
ID6974864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1625672 
End bp1627972 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content65% 
IMG OID643390986 
ProductTonB-dependent receptor 
Protein accessionYP_002275850 
Protein GI209543621 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.271102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTTC CCCGCAGCAC CCGTTTTGCC CTGTTGAGCC TGCTGTCGTC AGGGATGGCA 
TCCGTGACAT TGCCGGCCTG GGCCGGAACG CCGGATGCGC CGGCCCCGGC AGGAACCGGG
CCGGCCGACG CCCGGCACAC CGCACCCGGG CAGACGCAGG CACGGCCGGC CGCAACGACG
GTGACCACCC GTGCGGCACC GGGCCAGCCC TCCGCCTCGG CGCCCGAATC GATCACCGTC
ACCGCACGGC TGGACCGGGC ACGGGCCGCA TTGCAGCCTT CGACCGGGGC CACGGTCTAC
AGCTTCAGCC GCAACGCCAT CGAGACCGTG CCGGGCGGCG ACAACGCGCC GCTGAACAGC
GTGCTGCTGC AGGCCCCCGG CGTGGCGCAG GACAGTTACG GCCAGATCCA TGTGCGCGGC
GACCATAACG AGGTCCAGTT CCGGCTGGAC GGCGTGCAAT TGCCCGAAGG GCTGACCGTG
TTCGGCCAGA CGCTGATGAC GCGCTTCGCC GATTCCATGA CCATGACGAC CGGCGCGCTG
CCCAGCGAAT ACGGGTTCCT GCAGGCGGCG GTGATCGACA TCACGACCAA GAACGGCACC
ACCGATCCGG GGGGCGAGGC CTCGATCTAT GGCGGCGCGC GGGACTATTT CTTTCCGTCC
CTGCAATATG GCGGACATAG CGGAAAATGG GATTATTTCG CCACCGGGGA TTTCGTGCAT
GACCGGGTGG GGATCGAGAA CACCACCTCC AGCTTCAACG CGCTGCACGA CCTGACGAAC
CAGTACCATT TCCTGGGGCA CCTGCGCTAT ACCGCCGACG ACGATACGCG CATCAGCCTG
ATTGCCGGGG TATCGAACGC GGAATACCAG CTTCCGAACA ATCCCGGTCA GCAGACGCAG
TTTCAGCAGC CGTCCTTCGC CGGCAGCGCA CTGGCACAGC AACTGGCCGG CAGCGTCGAC
AGCGCCAGCC TGGACGAACG CCAGCGCGAG ATCACGGATT TCGCCATCCT GTCGCTGCAG
AAGGAAATCG GACGGTTCAG CCTGCAAAGT TCGGCTTTCC TGCGCTACAG CAGCCTGGAT
TATTCGCCCG ACATGCTGGG GGACCTGGTC TATAACGGCA TCGCGCAGCA GGCGTCGCGG
TCGGTGTTCT CGTCAGGGAC GCAAAGCGAC GTGACCTGGC GGGTGGCGCC CCGGCATACC
GTGCGCTTCG GCTACCAGGT GTTCGTCGAA CGGAACGTGT CGCAGACGGA TTCTCTGGTC
TTTCCGCAGA CCGGCACCGA TGCGGCAGGC AATGCCGTGT TCGGCACGAC GCCGGAATCG
ATCCATCAGG GCAGCGGGCT GACGGGGACC ATCTGGGGCC TGTATATCCA GGACGAATGG
AAGCCGCTGC GCAACCTGAC GGTGAATTAC GGGCTGCGGA TGGACGGGGT GGACGAATAC
ACCCACCAGC AGCAACTGAG CCCGCGACTG AACCTGGTCT GGACACCGTG GCGCGGAACC
ACGCTGCATG CCGGATATTC GCGCTATTTC ACGCCGCCGC CCTTCGAAGT GGTCAGCGGC
GCGTCGGTCG CCGCCTTCAA CGGCACCAGC GCGCAGGCGG CAAGTCCGCA GAGCAGCACG
GTCAAGGCCG AAAGCGACCA TTATTTCGAT GCCGGCATCA CGCAGCGCCT GCTGCCGGGG
TGGCAGGTCT CGTTCGATGC CTATTACAAG CTGGCGCACA ACCTGATCGA CGAGGGCCAG
TTCGGCGCGC CGATCATCCT GTCGGGCTTC AATTACCGGC GCGGACAGGT GAACGGCTAT
GAACTGGCCA CGTCATACGA TCGCGGGCCG CTGTCGCTGT ACGGCAACAT GGCGTGGTCG
CGGGCGATCG GAAAGGACAT CACCAGCGCG CAGTTCAATT TCTCGCCCGA CGACCTGGCC
TATATCCAGC ATCGCTGGAT CTACCTGGAC CATGACCAGC GCTGGACGGC GTCGGCAGGG
GCGTCGTACA GCTTCTTCCA CCGCACCGGC CACCCGACCA GGCTGTCCGC CACCATGGTC
TATGGCAGCG GCCTGCGCGC CGACGGCGAC GTGCCCAACG GGGTCAAGCT GCCGCAATAC
GTGACGTTCA ACCTGTCGCT GGTCCAGTCG TTCCAGGACC TGTTCCACGT GCCGTTCCTG
AAACGCACGC AACTGCGGCT GGACGTCATC AACCTGTTCG ACCGGACGTA CGAACTGCGC
GATGGAACCG GAATCGGCGT GGGCGCGCCG CAATACGGGT TGCGCCGCAC GATCCTGACC
GGGATTTCGC AGCGGTTCTG A
 
Protein sequence
MPLPRSTRFA LLSLLSSGMA SVTLPAWAGT PDAPAPAGTG PADARHTAPG QTQARPAATT 
VTTRAAPGQP SASAPESITV TARLDRARAA LQPSTGATVY SFSRNAIETV PGGDNAPLNS
VLLQAPGVAQ DSYGQIHVRG DHNEVQFRLD GVQLPEGLTV FGQTLMTRFA DSMTMTTGAL
PSEYGFLQAA VIDITTKNGT TDPGGEASIY GGARDYFFPS LQYGGHSGKW DYFATGDFVH
DRVGIENTTS SFNALHDLTN QYHFLGHLRY TADDDTRISL IAGVSNAEYQ LPNNPGQQTQ
FQQPSFAGSA LAQQLAGSVD SASLDERQRE ITDFAILSLQ KEIGRFSLQS SAFLRYSSLD
YSPDMLGDLV YNGIAQQASR SVFSSGTQSD VTWRVAPRHT VRFGYQVFVE RNVSQTDSLV
FPQTGTDAAG NAVFGTTPES IHQGSGLTGT IWGLYIQDEW KPLRNLTVNY GLRMDGVDEY
THQQQLSPRL NLVWTPWRGT TLHAGYSRYF TPPPFEVVSG ASVAAFNGTS AQAASPQSST
VKAESDHYFD AGITQRLLPG WQVSFDAYYK LAHNLIDEGQ FGAPIILSGF NYRRGQVNGY
ELATSYDRGP LSLYGNMAWS RAIGKDITSA QFNFSPDDLA YIQHRWIYLD HDQRWTASAG
ASYSFFHRTG HPTRLSATMV YGSGLRADGD VPNGVKLPQY VTFNLSLVQS FQDLFHVPFL
KRTQLRLDVI NLFDRTYELR DGTGIGVGAP QYGLRRTILT GISQRF