Gene Gdia_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1115 
Symbol 
ID6974519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1252199 
End bp1253629 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content62% 
IMG OID643390644 
ProductCarbohydrate-selective porin OprB 
Protein accessionYP_002275513 
Protein GI209543284 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3659] Carbohydrate-selective porin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.967056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.588415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCA CTTTTTCGCG CACGCGCCCC AGCCTTGCCT GGCGCATCTG GATCATGGCG 
GCCGTAGCCG TCCTTTCCTG CCATCCCGCC CGCGCGCAAT ATCGTGGTCC GGTCGCGGAA
ACCGCCCCGT CCTTCTCGCT CGATACCCCC ACATCGTACG CGAACACCCC GTTCACCCCG
CCGGTTGAAC ACATGGTTTC AGCCTGGGGC AACGCCGTCC AGAACCTCAA CCGCAAGGGA
ATCGGCCTGG TCATCGACTA TACCAGCGAA AGCGCACTGG CGCTCGATGC CGGCAATGCC
GGGGACGCAG GCTATGCACA CCAGATCGGC GTACAGTTGG ACCTGGATTG GGACAAACTG
GTGGGCTGGC GCGGCTTTGT GACCCATGCG GCCATCGTCA ATCGGGCGGG CCATAACATG
GCCGCCGATT TCGGTGACAG GTCACTCAAC GGATTCCAGG AGATCTATGG CGGCGGCGGC
AATACGGCCG TTCACATGGT CTATGTCTAT GGCACGCAGA ACCTGTTCCA CGACCGCGTG
CAGATCGCGA TCGGCAAGCT GCCGGTCAAT ATCGACTTTT CCGCGTCCCC CCTGTTCTGC
ACGTTCATGA ACAAATCCAT GTGCGGAAAC CCCAAATCCC TGACCCGCGG CGCCGCGGGT
TTCGGCACCT ATCCGGGTTC GACCTGGGGG ACGCGCGTAC GCTACTGGCC CATGCACGGG
GTCTACGCGC AGGCCGGACT GTACGGCGTC AATCCGGACC TCAATACCAA TCGGTATGAC
CGCACCGGAT TCAACTTCAA CACGAATCTC TATACCGGCG TCTACGTTCC GGTCGAGGTC
GGCCTGATCC CGTCCTTCGG CAGGAACCAG CTTGTCGGTC ACTACAAGGT CGGCGTCGCC
TACGATTCCT CGAACTACGC CGACAATTAC TACGATGTTA ACGGCGCCCC CCTGGCGCTG
ACGCGCAGGG CCGCGCGGAT GGACACCGGC AAGACGCAGC TCTGGATCGA AGGCGACCAG
ATGCTGATCC GCAACGGCCA TGGCCCGCTC AACGGATTCT ATGTCATGGC CGGCCTGGTG
CGTAACACGC CGGAAAGCAG CCCGTACCTC TATCAATATT ATTTCGGGAT CGTGGACCGG
GGCTTCTGGC GCGCGCGCCC CGACGACACG TTCGGCATCG AGGTCTCGCG GGCCACGGCC
AGTCCGGACC TTGTCGATAC GCAATGGCTC GATTACGCAG CGGGGCGCAA GCTGCCGGCC
AATGCCACCT ATCCGCAAAG CCATATCAGC GTGCTGGAAG CCACCTACAA CATCCACGTC
TGCGAGGGGC TCTCGATCCA GCCGGACTAC CAGCGGATCA TGCGGCCCAA CCTGCAGCGC
AACAAACCCG CGATCGACGC GATCGGCCTG AAGATCCACG CGACGCTCTG A
 
Protein sequence
MRITFSRTRP SLAWRIWIMA AVAVLSCHPA RAQYRGPVAE TAPSFSLDTP TSYANTPFTP 
PVEHMVSAWG NAVQNLNRKG IGLVIDYTSE SALALDAGNA GDAGYAHQIG VQLDLDWDKL
VGWRGFVTHA AIVNRAGHNM AADFGDRSLN GFQEIYGGGG NTAVHMVYVY GTQNLFHDRV
QIAIGKLPVN IDFSASPLFC TFMNKSMCGN PKSLTRGAAG FGTYPGSTWG TRVRYWPMHG
VYAQAGLYGV NPDLNTNRYD RTGFNFNTNL YTGVYVPVEV GLIPSFGRNQ LVGHYKVGVA
YDSSNYADNY YDVNGAPLAL TRRAARMDTG KTQLWIEGDQ MLIRNGHGPL NGFYVMAGLV
RNTPESSPYL YQYYFGIVDR GFWRARPDDT FGIEVSRATA SPDLVDTQWL DYAAGRKLPA
NATYPQSHIS VLEATYNIHV CEGLSIQPDY QRIMRPNLQR NKPAIDAIGL KIHATL