Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2169 |
Symbol | |
ID | 6975597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 2403707 |
End bp | 2406901 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643391698 |
Product | TonB-dependent receptor |
Protein accession | YP_002276542 |
Protein GI | 209544313 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4771] Outer membrane receptor for ferrienterochelin and colicins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.576978 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACCAA CCGGTGAGAC GCCGCCCCGC GCACCGGGCG CGGAACGCAA GCGACGACGC CCTTCGCCCT TCCTCGTAAT CCAACCCGCG CTGCGCAACC GCGGGCGCAT GCTGCTGTGC GCCACGGCGA TCGCCGCGAC GCCCGCCCTG GCCTCGGCCC AGACGGCTGC GCCCGCCACG ACGGCGACGC CGACCCACCA CCACAAGACC ACGACCCGTC ACACGACGGC CAAGGCGAAG ACGGTCACCC CGACCCGTGC GGCCCCCGCC GCCGTGGCTC CGGCGGCAGC CATCCCGCCG GCAGGGGTTC CCGCGACCAC GTCTTCCGCC CGCAACGCGT CGATCATCGC GGGCGCGGCC AATGCCACGT CTGCGCCGGA CAACGCGCCG GCGGCTGAAA ACGTGATCGT GACCGGCACG CTGTTCCGCG ATCCGAACCT GGCCAGCGCC TCGCCCATTA CCGAGGTGAC GCGCAAGGAC ATGGCCGCGC GCGGGCTGAA AACCGTGACC GACGCGCTGC AGGTCCTGTC CGCCAATGGC GCGGGCAACC TGACGAACTC GTTCTCCGCC TACGGCGCAT TTGCCGGCGG TGCCTCCGCC CCCTCGCTGC GCGGCATGAG CACGGACTCG ACCCTGGTTC TGATGGACGG GCAGCGTCTG TCCTACTACC CGCTGGCCGA TGACGGCGAG CGTAACTTCG TCGACACCAA CTGGATGCCG TCCTCGATCA TGGATCGGGT GGACGTGATG CAGGATGGTG GCTCCGCCAC CTATGGTGCC GATGCCGTGG CTGGCGTCGT CAACTACATC ACGCGCAAGC AGATCAAGGG CTTCGAAGGT AATGCCGAAG GCGGCCTGAG CCAGCGTGGC GATGCCGGTC ACCAGAAGCT GTATGCCACC TATGGCTGGG GCGATCTGGA CCGCGACGGC TGGAACTTCT ACCTCAACTC CGAATACCAG CAGGATGACG CGCTGTACAA CCGTCAGCTC GGCTATCCGT ACAATACAGG CGACCTCAGC GGGCTCGCCG GCGGGTATAA CGGAAACACC AACGCCCCCG GATCGACGAT CAACAACTTC GGCGCCACGC CGACGGCCGT CGTCTCGCCC ATCGGATTTA CGCCGACTTC AACCGGCGGC AGCATTCCGA GCAACACGGG CCCCTGGCAA CTGCTGAACC CGAGCGCCGG GTGCACGGGC GCGGGCGTCA TCGGCAAGGT CAGCGGCTCT GTGTCGGGTG CGCCGGGCGT CAGCACGACC TGCACGCAGA ACGCGGTCGG TGCCTACAAG CAAATCCAGC CCGAACTCCG TCGCATCAAC GCCACGGCGC ATTTCATCGC CAACGTGACG CCGCGTTCGC AGTTCGTGAC GATGTTCAAC TACTCGCAGG TCCAGTCGAG CTACAACGCG AATCCGCCCT ATTCGACCCT GGCATCGACC CCCTATCTGA CCGCCACGGC GCAGAACACC TACCTGCCGA GCACGAGCCC CTACAACCCG TACGGGCAGG CGGCACAGGT CCTGGCCACG TACGGCGGAC TGCAGCCGTT CACCACGGAG TTCAGCCAGA ACTTCCGCGG ATCGATGCGC TATTCCGGCT GGGCCCCGTC AAAATGGGGT TCGAACTGGA ACTACGACAT CAACTTCGTC GGCATGAACA CGGTCCTGCA GCAGGTGGAT ACCGGTTTCC CGACGATCAG CGGCATCGAG AATTCGATCA CGAGCGGCAG CTATAACTTC GCCAATCCGT CCGCGAATTC CAAGAGTGAA CTGAATTCCA TCGCGCCCCG CAACGTGCTG AATGCCCGTA CGCAGGAATA TTCGGAAGAC ATGCATGTCA GCAAGGGGCT GTTCAAGCTT CCGGGCGGCA TGGTCAATGT CGCTATCGGC GGCAACATCC GCTGGGAATC GGTCAACGAC CCCAATGCCA ACCCGCAGAC GGCCAATCCT GCGAACGAAT GGGCCGGCAT CAATCCCTTC AGCGCCAAGG GCAGCCGCTG GGTTGAATCG GGCTACTTCG AAGTGGGTCT GCCGATCATC AAGATGCTGA ACGCGGACAT CTCCGGCCGC TACGACAACT ACTCCACCGG AATGCATCAC TTCTCGCCGA AGGTTGGCGT GAACTTCAAG CCGGTGAAGC AGTTTGCGCT TCGCGGCACG TTCTCGCGCG GCTTCCGCGT GCCCAGCTTT GCTGAAACCA GCGGCTCCGT CCTCGGCTTC ACGGGGTACA CGCCGAACAG CTCCATTCCG GGCATCGCAG CCTGGCAGGC GACGCACGGC AACGACGGCT ATGCAACGAA TCCGTATTCC ATCGGCGTCA ACACGGTGGG CAATCCCAAC CTGAAGCCGG AAATTTCGAC CAACTTCACC GGCGGGGCGG TGATCAGCCC GCTGGACTGG CTGCACCTGT CGGCCGACTA TTACTACATC AAAAAAACCA ACTACATCAT GGCGAACCCG TACGTGAATC CCGTCGCGGC GGCCAATGAC TACATCCTGG GCGAAGCTCT ACCGTCGGGC ATCCTGTCGG CCACTCCGTC CATTGCGGAT ATCCAGGCTC CTGGCGCGAA GGCATCTCCG GGTATCTTCA CTGACCAGTA CATGAACGCC CGCAGCTACA TGACCAATGG TGTGGATCTG TCGATCGACG CCACCCGTCA TCTGCCCGGT CCGCTGCATG ACGTGCTGTG GTTCAGTAAG GGCACGGCCA CGTATGTGCA TGCGTCCAAC CTGACGCTGC CGGACGGCAG CGTCTATCAC TACGCCGGGA CGATTGGTCC GTACGAGCAG GTGTCGGCCT CTGGCACACC GCGTTGGAAG GCCAGTTGGT CGAACACCTT CAGCTGGAAG GGTCTGGGCG TCACGCCGAC GGTCTATTAC ACCAGTGGCT ACAAGACCAC CGCTGAAGAC CAGAACGGCG CCAACACCAA CACGTGCGCA TACACCCTGT CGGGCTACGG CGCGACCGGT GCCCCCAGCA CCCAGTGCCA TGTCAAAAAC TGGTGGGACG TTGACCTGAC GGTCAACTAC CAGATCAACC CGCGCTGGTC GGTCTATGCC AACGTCTATA ACCTGCTGGG CTTCCGCTCG CCGTACGATT ACGCGACCTA TGGTTCCTAC CTGTACAACT CTTCCTGGAC GCAGAAGGGC ATCATCATGC GGTCCTTCCA GTTCGGTGTG AACGTCCGTC TCTGA
|
Protein sequence | MAPTGETPPR APGAERKRRR PSPFLVIQPA LRNRGRMLLC ATAIAATPAL ASAQTAAPAT TATPTHHHKT TTRHTTAKAK TVTPTRAAPA AVAPAAAIPP AGVPATTSSA RNASIIAGAA NATSAPDNAP AAENVIVTGT LFRDPNLASA SPITEVTRKD MAARGLKTVT DALQVLSANG AGNLTNSFSA YGAFAGGASA PSLRGMSTDS TLVLMDGQRL SYYPLADDGE RNFVDTNWMP SSIMDRVDVM QDGGSATYGA DAVAGVVNYI TRKQIKGFEG NAEGGLSQRG DAGHQKLYAT YGWGDLDRDG WNFYLNSEYQ QDDALYNRQL GYPYNTGDLS GLAGGYNGNT NAPGSTINNF GATPTAVVSP IGFTPTSTGG SIPSNTGPWQ LLNPSAGCTG AGVIGKVSGS VSGAPGVSTT CTQNAVGAYK QIQPELRRIN ATAHFIANVT PRSQFVTMFN YSQVQSSYNA NPPYSTLAST PYLTATAQNT YLPSTSPYNP YGQAAQVLAT YGGLQPFTTE FSQNFRGSMR YSGWAPSKWG SNWNYDINFV GMNTVLQQVD TGFPTISGIE NSITSGSYNF ANPSANSKSE LNSIAPRNVL NARTQEYSED MHVSKGLFKL PGGMVNVAIG GNIRWESVND PNANPQTANP ANEWAGINPF SAKGSRWVES GYFEVGLPII KMLNADISGR YDNYSTGMHH FSPKVGVNFK PVKQFALRGT FSRGFRVPSF AETSGSVLGF TGYTPNSSIP GIAAWQATHG NDGYATNPYS IGVNTVGNPN LKPEISTNFT GGAVISPLDW LHLSADYYYI KKTNYIMANP YVNPVAAAND YILGEALPSG ILSATPSIAD IQAPGAKASP GIFTDQYMNA RSYMTNGVDL SIDATRHLPG PLHDVLWFSK GTATYVHASN LTLPDGSVYH YAGTIGPYEQ VSASGTPRWK ASWSNTFSWK GLGVTPTVYY TSGYKTTAED QNGANTNTCA YTLSGYGATG APSTQCHVKN WWDVDLTVNY QINPRWSVYA NVYNLLGFRS PYDYATYGSY LYNSSWTQKG IIMRSFQFGV NVRL
|
| |