Gene Gdia_3297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3297 
Symbol 
ID6976737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3602392 
End bp3604479 
Gene Length2088 bp 
Protein Length695 aa 
Translation table11 
GC content68% 
IMG OID643392808 
ProductOligopeptidase B 
Protein accessionYP_002277639 
Protein GI209545410 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.501121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.104616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC CCCCGGTCGC CCGCACGGAC CCCCAGACGA TCACGCAGCT CGGCCGGGTC 
CGCATCGACG AATATGGGTG GATGAAGGAC GAGAACTGGC AGGCCGTCCT GCGCGACCCA
TCGGTGCTGC GCGCGGACAT CGCGGCGCAC CTGCGGGCCG AGAACGCCTA CACCGCCGCC
ATCCTGGCCC CGACCGAATC CCTGCAGGCC ACCATCGCCG CCGAGATGAA GGCCCGCATC
CGCGAGGATG ATACCTATCC CGCGCTGCCG CACGGTCCCT GGCGCTATTA CATGCGCTAC
GCGGCCGGCG CCCAGCACCC GATCCATGCC CGCCACCCCG AGGGCCGGCC CGAGGCCGAG
ACGATCCTGC TGGACGTGGA CCGGATGGCG AAGGAGCACG AGTATTTCGC CGTCGCCACA
GCGACCCACA GCCCCGACCA CCTGCTGTTC GCCCATGCCG AGGACAATCA GGGGTCCGAG
GTCTATCGCG TCGTCATCAG CGACATCGCG CGCGGCACGC CGGTCGGGCC GGCGATCGAA
AACTGCAGCG GCAATTTCGC GTTTTCGTCC GACGCGCGGC ATCTGTTCTG GACCTGGCGG
GACGCGCATG GCCGCCCGAC GAAGATCTTT CGCCGCCGCA TCGGCACCGA CGAGGACGTG
CTGGTCTACG AGGAAACGGA CCCCGGCTTC TTCATCGGGG TCGAGGCCAG CCGCTCCGGA
CGCTGGATCG TCATTTCGGC CAGCAACCAG GACACGTCCG AATCCTGGCT GATCCCCGGC
GGCAATCCCG AGGCGGCGGC CGCGTGCGTC GAGCCGCGCC GCACCGGCGT GCTCTACAGC
CTCAGCCACT GGGGGGACCG CTTCGCGATC CTGACCAACA CCGACGGCGC CGTCGATTTC
AAGCTGATGG AAGCGCCCGA TACCACCCCG GGGCGGGCGC ACTGGCGCGA CCTGGTCCCG
CACCTGCCCG GCCGCTACAT CACCGACTGC ATGGCCTTTT CCGGGCATCT GGTCTGGCGC
GAGCGCCGGG ATGCCAACAC CGCGCTGGTC GTCCGCCGCG TCGACGGGAC CGAGCATGTC
CTGTCATCGG ACGAGGACGC CTATGTCCTG TCCTTCTCGG GGTCCTTCGA ATACGACACG
CGCGAATTGC GCTATGTCTA CCAGTCGCCG ACCACGCCCC GGCAATGGTA CGCCTATGAC
ATGGACAGCC GTACCCGCCA CCTGCTGAAG ACGCAGGAGG TACCGTCGGG ACACGACCCG
CGCGATTACA GATGCTGGCG CCTGACCGCC ACCGCCCTGG ACGGGACGCA GGTGCCGATC
ACCGTGCTGG GGCGGCACGG CACGCCCATC GACGGCTCCG CCCCCCTGCT GCTCTATGGT
TACGGCTCGT ACGGCCATGC GATCGAACCG ACCTTTTCCA CTGGGGTCTT CAGCCTGGTG
GACCGGGGAT GGTTCTACGC CATCGCCCAT GTGCGCGGCG GGTCGGAAAA GGGCTGGAAC
TGGTTCCTGG GCGGACGGGG CCGCAACAAG CCCAACAGCT TCACCGATTT CATCGCCTGC
GCCGAACACC TGATCGCGGA CGGCTTCACC GGGGCGGGAC GGATCGTGAC GGACGGGCGT
TCCGCCGGCG GGATGGTCAT GGGGGCGATC GCCAACATGC GCCCCGACCT GTTCGCGGGA
ATCGTCGCCG TGGTGCCGTT CGTGGACGAG CTGAACACGA TGTCGGACAC CAGCCTGCCG
CTGACGCCGC CGGAATGGCC GGAATGGGGC AATCCGCTGG AAGATGAAGC GGCCTACGAC
CTGATCGCCA GCTATGCCGC CTATGAACAG GTGGCGCCGC GCCCGTATCC GGCGATCCTG
GCCATCGGCG GCCTGTCGGA CCCGCGCGTG ACCTACTGGG AACCGGCGAA ATGGATCGCC
CGGCTGCGCG CGCACACGAC GTCGTCCCGC CCGCTGCTGC TGCGCATCAA CATGGAGGCC
GGGCATGGCG GCGCGTCCGG CCGCTTCGAC GCATTGAAGG AGGCCGCCCT GATCCAGGCC
TTCGCCATCT GGGCGGTGGA TACGACCGAC CATACGAGAA CCGAATGA
 
Protein sequence
MTTPPVARTD PQTITQLGRV RIDEYGWMKD ENWQAVLRDP SVLRADIAAH LRAENAYTAA 
ILAPTESLQA TIAAEMKARI REDDTYPALP HGPWRYYMRY AAGAQHPIHA RHPEGRPEAE
TILLDVDRMA KEHEYFAVAT ATHSPDHLLF AHAEDNQGSE VYRVVISDIA RGTPVGPAIE
NCSGNFAFSS DARHLFWTWR DAHGRPTKIF RRRIGTDEDV LVYEETDPGF FIGVEASRSG
RWIVISASNQ DTSESWLIPG GNPEAAAACV EPRRTGVLYS LSHWGDRFAI LTNTDGAVDF
KLMEAPDTTP GRAHWRDLVP HLPGRYITDC MAFSGHLVWR ERRDANTALV VRRVDGTEHV
LSSDEDAYVL SFSGSFEYDT RELRYVYQSP TTPRQWYAYD MDSRTRHLLK TQEVPSGHDP
RDYRCWRLTA TALDGTQVPI TVLGRHGTPI DGSAPLLLYG YGSYGHAIEP TFSTGVFSLV
DRGWFYAIAH VRGGSEKGWN WFLGGRGRNK PNSFTDFIAC AEHLIADGFT GAGRIVTDGR
SAGGMVMGAI ANMRPDLFAG IVAVVPFVDE LNTMSDTSLP LTPPEWPEWG NPLEDEAAYD
LIASYAAYEQ VAPRPYPAIL AIGGLSDPRV TYWEPAKWIA RLRAHTTSSR PLLLRINMEA
GHGGASGRFD ALKEAALIQA FAIWAVDTTD HTRTE