Gene Gdia_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1838 
Symbol 
ID6975260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2042585 
End bp2044975 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content65% 
IMG OID643391363 
ProductTonB-dependent receptor 
Protein accessionYP_002276213 
Protein GI209543984 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGTC ATCCACGGCC CGCGCGCCCA ACGCACAGGG CCGGTCCGCG CCGCGTAGCG 
CTAGCCGTTT TCGTGCTTTC CATGCATGGC GCGGCCGTAC CCGCCCAATC GGCGGCCACG
CCCCTCCCCT CCAACCGCAC CTCCAGCAGG ACACCGGTCG CGCCGGCACA GAAGATCGAT
CCGCGAACGC ACGCCGCCGC GCCGCCCGCG CCCGCGCCCA AGGCCGAGAC GCTGGTCGTG
ACCCGGCAGG CCACGGCGCC GGACGCCAGG CGCTTCGCGC TGCCGCAGAC CAGCGCGGGC
ATCGACCGGC GGACGATCGA GGCGACCGTC AACATCGTCG ATACCGAAGA CGCGCTGAAA
TACCTGCCCA GCCTGCTGCT GCGCAAACGC AACAACGGCG ACACGCAGGC CACGCTCGAA
ACCCGGACCT GGGGCGTCAA TTCCAGTGCC CGCAGCCTGG TCTATGTCGA CGATGCCCCG
ATCTCGGCCC TGATTTCCAA CAACAACACG AACGGCGCGC CGCGCTGGGG CATGGTGACA
CCCGAGCAGA TCGAACGGAT CGACATGCTC TACGGCCCGT TCGCGGCGGA ATATCCCGGC
AATTCCATGG GCGGCGTCGT GCTGGTCACC ACGCGGATGC CCGACCGTTT CACGGCCACG
GTCAAGCAGA CCGGCAGCGT GCAGACCTAT GATGCCTACC GGACCAGGGG GAACTACGGC
ACCGCGAACA GTGCGGTGAC GATCGGCGAC CGGATCGGCC GCCTGTCCTG GCTGTTCAGC
GCCAACCGGG AGGAAAGCCT GGGCCAGCCG CTGTTCTTCG TCACCAGTTC CGGCATGCCG
GCCGGCACGA CGGGCGGGAT CGCGGCGCTG AGCAAGACGG GGTCGGTCGC AAACGTCATG
GGAGCGGGCG GGCTGCAGCA CAGCACGACC GATAACGTGA CGCTGCGCCT TGCCTATGAT
TTCACGCCAT GGCTGCGCCT GAACTATACG GTCGGATACT GGGACAACCA GACCCGCGCC
CGGTCGCAAT CCTACCTGAC CAATGCCGCC GGTGCCGCCA CCTTCGGCGG TGTCGCGGGC
TTCGCCAACG ACACCTACAC GTATGACGAA CAGCACCTGA TGAACGCGGT CTCGCTGAAG
ACCAGGACGC AGGGCCATTG GGACGGGGAA GCCATCTTCA CGGACTATGA CTACCTGAAG
GACATCCAGC GCAATCCGGC CGGCGTGCTG GGCGGCACGA ACTTCACGCC GAACGGCTAT
ATCGCGCGCA TGGACGGGTC CGGCTGGATG ACGGCGGACG TCAAGGGAAT CTGGCGCCCC
AGCGGCGTCG GCGGCGCGCA TGAACTCAGC TTCGGCGGCC ATCGGGACCA GTACGACCTG
GAAAACCCGA CCTACAACAC CGGCAACTGG GCGTCCTCGC CCGCGGCAGG AAACGGCACG
CTCCATTCCA GCGGGCGTGG AACGACGACG ACCTATGCCC TCTGGGCGCA GGAGGCATGG
AAGTTCGCGC CGGGCTTTAC GCTGACGGTC GGCGGAAGGC TGGAATTCTG GCGGGCCTCC
AACGGCTTCA ACCAGGCCGG CGCCGTCGCC GCCAGCCAGC CGCCGGAACA TTCGACCAAT
TTTTCGCCCA AGGCGACGCT GGCGTGGCGG ATCAACCCGG ACTGGACCGC CAAACTGTCC
TTCGGCGAGG CCTGGCGCTA CCCGACGGTG TCCGAGCTGT ACCAGATCGT CTCGACCGGC
GGAACATATG CCGTCCCCAA CGCCACCCTG CGGCCCGAAC AGGTGTTCTC CAGCGAGGCC
ATGATCGAAC GCCGCACGCA CAGCGGCAGC CTGCGCCTGT CGCTGTTCCA GGAAAACACG
CACAACGCAC TGATCTCGCA AAGCACGTTG CTGAACAACA TCTATACGAC GACGTTCCAG
AACGTGACCG AGGTGCGCAA TCGCGGCGTG GAATTCGTGG CGGAACGGCG CGACCTTCTG
ATCAAGGGCT TCGACCTGTC AAACAGCTTG ACCTATGTCG ATTCCCGCAT TCTCTCCGAC
CCCGGCTTCC AGAGTTCGAC CGGCACCACC GCCGCCGGCA AGCATGTGCC CTATGTGCCG
GACTGGCGCG ACACGGTACA GGCCACGTGG CACGCGACGA AGCGCCTCGA TCTGTCGGCG
GCGCTTCGCT ACCAGGGCCG GATGTATTCC ACGCTGGACA ATACCGACCG TGTCGGCCAC
GTCTTCGGCG CGTTCGATAA ATTCCTTGTT GCGGACGTGC ATGTTCACTG GCACGTCACG
GGACCGCTGA CGTTCGATGC CGGCATCGAT AATATCAACA ATGCCCGGTA TTACGAATAT
CATCCGTTTC CGATGCGGAC GTATGTTGCG GATCTGAAGG CCAGTTTTTA A
 
Protein sequence
MSGHPRPARP THRAGPRRVA LAVFVLSMHG AAVPAQSAAT PLPSNRTSSR TPVAPAQKID 
PRTHAAAPPA PAPKAETLVV TRQATAPDAR RFALPQTSAG IDRRTIEATV NIVDTEDALK
YLPSLLLRKR NNGDTQATLE TRTWGVNSSA RSLVYVDDAP ISALISNNNT NGAPRWGMVT
PEQIERIDML YGPFAAEYPG NSMGGVVLVT TRMPDRFTAT VKQTGSVQTY DAYRTRGNYG
TANSAVTIGD RIGRLSWLFS ANREESLGQP LFFVTSSGMP AGTTGGIAAL SKTGSVANVM
GAGGLQHSTT DNVTLRLAYD FTPWLRLNYT VGYWDNQTRA RSQSYLTNAA GAATFGGVAG
FANDTYTYDE QHLMNAVSLK TRTQGHWDGE AIFTDYDYLK DIQRNPAGVL GGTNFTPNGY
IARMDGSGWM TADVKGIWRP SGVGGAHELS FGGHRDQYDL ENPTYNTGNW ASSPAAGNGT
LHSSGRGTTT TYALWAQEAW KFAPGFTLTV GGRLEFWRAS NGFNQAGAVA ASQPPEHSTN
FSPKATLAWR INPDWTAKLS FGEAWRYPTV SELYQIVSTG GTYAVPNATL RPEQVFSSEA
MIERRTHSGS LRLSLFQENT HNALISQSTL LNNIYTTTFQ NVTEVRNRGV EFVAERRDLL
IKGFDLSNSL TYVDSRILSD PGFQSSTGTT AAGKHVPYVP DWRDTVQATW HATKRLDLSA
ALRYQGRMYS TLDNTDRVGH VFGAFDKFLV ADVHVHWHVT GPLTFDAGID NINNARYYEY
HPFPMRTYVA DLKASF