Gene Gdia_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2044 
Symbol 
ID6975471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2264245 
End bp2266575 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content64% 
IMG OID643391574 
ProductTonB-dependent receptor 
Protein accessionYP_002276419 
Protein GI209544190 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4774] Outer membrane receptor for monomeric catechols 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGTCC ACAGGAAATA TCGCGTCGAA AAAATCCGGG GGACCGTTTC CAATCTTACC 
CTCTGCGGTA CCACCCTGCT GCTGGGGGCC CAGTCCGTCG CCCATGCTGC CGACACGCTG
AAGCGCGACG AGAAAAAGAA GAAGACGCAC GACCCCGCGA TCCACCAGCA GGGTCGTCCG
CTTCCGGCGC ACGAGGAAAT CGACGTCGTC GGGCAGTGGC AGCACCGGTC GCCCAAATTC
ACCGCCCCGC TGCTGGACAC GCCCAAGAGC GTGAAGATCA TTTCACGCGA ATTGCTGGAC
CAGACCGCAT CCCACACGCT GGTCGACGCC CTGCGCAACG TCCCGGGCAT CACGATGGGG
GCGGGCGAAG GCGGGAACCC GGTCGGCGAC CGGCCGTTCC TGCGCGGTTT CGATGCGCAG
TCCAGCACGT TCGTCGACGG CCTTCGCGAC GTCGGCGCGC AATCGCGCGA GACGTTCAAC
GTCGAGGCCG TGGAGGTCAT CAAGGGCGAT TCCGGCGCGA TCAGCGGGCG GGCGGGTGCC
GGCGGGTCCA TCGTGATCGA CAGCAAGATG CCCAGGCTGA AGAACTCGCT CGATGCCAGC
CTGGGCTTCG GCAATGCCGG TTACAAGCGC GGGACGCTGG ACGGCAACTG GCAGTTTTCG
CGGACCGGCG CCTTCCGCCT GAACCTGATG GGCGACGACG AAAACAAGGC TGGTCGGGGA
CCCACAGAAT TCAAGCGGTA TGGGGTGGCG CCCTCGGTCT CGCTGGGGCT GGGCACGCCC
GACCGGGTCA CCCTGATGTA TTACCACATG CAGAACGACG ACCTGCCCGA TGTCGGCATT
CCCTACGACA ATCCCACCTT CAACGCCCGC ACCGACGGCG CCCCGCGCCT GATGACGGCC
GGCAACGGCG CGCCCGTCAA GGTCCCGTTC GACACCTGGT ACGGCCTGGT CAACCGCGAT
TCCGACCAGG ATTCGATCGA TATGGGCACG CTGCGGCTGG AGCACGATTT CAACAGCCAC
CTGCATATCC GCAACACGAC GCGCTATTCC GAGACCAGCC AGAACGACCT GTGGACCATG
CCGGACGACA GCCAGGGCAA CATCTATTAC GGCTATGTCT ACCGGCGTCT CAACAGCCGC
GTCTCGACGT TCGATACGGC CATAAACCAG ACCGACTTCT ATGGAACGGT GTCCCTCCTG
GGGTTCCGCA ACCAGTTTTC GACCGGCATG GAATTCACGC GCGAACAGGG CAAGAACGAT
ACCGACACGG CTTATGTGAA CGGCGTCAAC GTCGCGTCCG GCACGAATTT CACGCATTGC
GCCACGGCGC TCGCCTTCAC GTCCCATACC TGTACAACGC TTTCCAATCC CAATCCCAAC
GATATGTGGA CCGGCGATAT CCGCCGGACG GGAAACCCCA ACAGCACGCG CATGGATACC
AAGGCGGTCT ACCTGACCGA CACGGTCACC TTCATGCCGC AGCTTCTGGG AAATTTCGGC
GTCCGGCTGG ACAATGTCCA AAGCACGTAC CGCGCGCAGG CGGGGGAATA CGGGCGCGGG
GACAACCTGT TCACCTACCA GGGCGGCCTG GTCTACAAGC CCGCCCGCAA CGGATCGATC
TATGCGTCCT ACGCGACGTC GGCGATTCCG CCGGGCAACT CGGTGGGACA GGGGGCGGAC
GACATCAGCC TCGGCGCGGA CCGGACCGGC AATATCGGCA GCGTGCTGAA GCCGGAACGC
GACCGCACGA TCGAGGTCGG CACCAAATGG AACGTCCTGC ATGACAGGGT GACCCTGACG
GGCGCGCTGT TCCAGATCGA TACGACGAAT TTCAAGATCG CCACCGCCTC GGGCGGGATC
TCGAACGGCG GCAGCAAGCG CACGCGCGGG GGCGAGGTCG GCGTTTCGGG CCACATCACC
CGCGACTGGT CCATGACGGC GGGCTACAGC TATCTGGACG CGCGCCTGGT GCAGGCGGGC
GGCAGCGGTG CCGCCGCCGG CCTGATGAAC GGACGCCGCG CGCCGAACAC GCCGGAAAAC
AGCCTGGCAT TGTGGAACAC CTACGACATC ACCCCGGGGT TCAAGGTCGG TTCAGGCGTC
TATTACATGG GCAAGGTCTA TGGCGCGGAT TCGCCCACGG CGCCGAAATA CGTGCCCGGC
TACTGGCGGT TCGACATCAT GGCCAGCTAC CGCTTCCTGA AGCACTACAC GCTTCAGCTC
AACATCCAGA ACCTGACGAA CAAGCGGTAT TTCACCCAGG CCTACGTGAC CCATTACGCG
CTGCAGGCCG CGGGCCGCAC GGCCTTCGTC ACGCTGAATG CGCATTTCTG A
 
Protein sequence
MIVHRKYRVE KIRGTVSNLT LCGTTLLLGA QSVAHAADTL KRDEKKKKTH DPAIHQQGRP 
LPAHEEIDVV GQWQHRSPKF TAPLLDTPKS VKIISRELLD QTASHTLVDA LRNVPGITMG
AGEGGNPVGD RPFLRGFDAQ SSTFVDGLRD VGAQSRETFN VEAVEVIKGD SGAISGRAGA
GGSIVIDSKM PRLKNSLDAS LGFGNAGYKR GTLDGNWQFS RTGAFRLNLM GDDENKAGRG
PTEFKRYGVA PSVSLGLGTP DRVTLMYYHM QNDDLPDVGI PYDNPTFNAR TDGAPRLMTA
GNGAPVKVPF DTWYGLVNRD SDQDSIDMGT LRLEHDFNSH LHIRNTTRYS ETSQNDLWTM
PDDSQGNIYY GYVYRRLNSR VSTFDTAINQ TDFYGTVSLL GFRNQFSTGM EFTREQGKND
TDTAYVNGVN VASGTNFTHC ATALAFTSHT CTTLSNPNPN DMWTGDIRRT GNPNSTRMDT
KAVYLTDTVT FMPQLLGNFG VRLDNVQSTY RAQAGEYGRG DNLFTYQGGL VYKPARNGSI
YASYATSAIP PGNSVGQGAD DISLGADRTG NIGSVLKPER DRTIEVGTKW NVLHDRVTLT
GALFQIDTTN FKIATASGGI SNGGSKRTRG GEVGVSGHIT RDWSMTAGYS YLDARLVQAG
GSGAAAGLMN GRRAPNTPEN SLALWNTYDI TPGFKVGSGV YYMGKVYGAD SPTAPKYVPG
YWRFDIMASY RFLKHYTLQL NIQNLTNKRY FTQAYVTHYA LQAAGRTAFV TLNAHF