Gene Gdia_0252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0252 
Symbol 
ID6973644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp276874 
End bp278319 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content70% 
IMG OID643389783 
ProductLeucyl aminopeptidase 
Protein accessionYP_002274664 
Protein GI209542435 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.749602 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATCG AGGAATTTCC CTGCCTGTTG CCCCCGACGC GGGGCCGCCG CGCCGGCGTG 
CGGACGATTC ACGCCCTGCC GCAGGCCGAA CTGGGCACCC TGGGCGAGCG AATCGGGGCA
ACGGGCGCCG CCTTCGCGCG GGATACCGGC TTTCAGGCCC GTCCCGGAGA ACTGGCGCTG
ATTCCCGGTG CCAACGGCGT GGCGGCGGCG GTGCTGGGTG TGCGCACGGG GGCGGACGGC
GGGGCGGCCG ATCCGTTCCA GTTCGGCGGT CTGGCCGGGT CGCTGCCGCC CGGGGCGTGG
AAGATCGCGC TGCCCGATGG TGTGGCGCCC GCGACGGCGG TCCTGGGCTT CTGCCTGGGG
GCCTATCGCA TGCCGGCCTT CGGACGGGCT GAGGCGTCGC CGCCGGGCAA GGACATCGCC
CGCCTGATCG TGCCGGCGGG CGGCGGGGCA GGGGCCGAAG TGGCCCATGC GATCCGGCTG
GGCAGGGACC TGATCAACAC GCCGCCCAAC CTGATGGGCC CGGCCGAACT GGCCCGCGCC
GCCCGCCATA TGCTGAATCC GCTGGGCGCG CGGGTCGAAA CGGTCAAGGG GCGTGACCTG
GCCCGCGCCT ATCCCACCAT CGCGCATGTC GGCGCCGGAT CGGCCCGGGC ACCGAAGGTC
GTCATCGCGC GTTGGCAGGG CAGCGCCGCC GGTCCCGATG CCCCCCTGCT GTCGCTGGTG
GGCAAGGGCG TGTGCTTCGA TACCGGCGGG TACGACCTGA AGCAGCCATC GGGGATGCTG
CGCATGAAGA AGGACATGGG CGGTGCGGCG GTGATGCTGT CGCTGGCGCA CCTGATCCTG
ACCCGCGACC TGCCGATCCG CCTGGAACTG CGGCTGGGCT GCGTGGAAAA CAGCGTGTCG
GGCGAGGCGA TGCGGCCCTC GGACGTGGTC GTGACCCGCT CGGGCCTGAC GGTGGAAATC
GGTAACACCG ATGCCGAAGG GCGCCTGGTC CTGTGCGACC TGCTGACCGA TGCCTGTGCC
GCCGCCCCGG ACCTGCTGAT CGACGCGGCC ACCCTGACCG GCGCCGCGCG GGTGGCGCTG
GGTCCGGACC TGCCGGCCCT GTTCAGCCCC GACGACGCCG TCGCCCAGGC GATCCTGGAA
GCCGGCACGG CCCAGTGCGA TCCGCTGTGG CGGCTGCCGC TATGGGACGG ATACGCCGAC
TGGCTGCGGA GTCCGGTGGC CGACCTGAAC AACGTATCGG CCAAGCCGAT GGCCGGATCC
GTCACGGCGG CGCTTTTTTT GCGAAATTTT GTCAAAACAG ACGTGCGCTG GGCACATATC
GACCTCTACG GCTGGAACGA TCATTACAAA CCCGGACGGC CCGAAGGAGG GGAAACGCCT
ATTTTGCGGG CAGTTTATGC CTCATTGTTG CGAATCCTTA ATGTCGCGGA CAGGGTGTCA
CACTAA
 
Protein sequence
MSIEEFPCLL PPTRGRRAGV RTIHALPQAE LGTLGERIGA TGAAFARDTG FQARPGELAL 
IPGANGVAAA VLGVRTGADG GAADPFQFGG LAGSLPPGAW KIALPDGVAP ATAVLGFCLG
AYRMPAFGRA EASPPGKDIA RLIVPAGGGA GAEVAHAIRL GRDLINTPPN LMGPAELARA
ARHMLNPLGA RVETVKGRDL ARAYPTIAHV GAGSARAPKV VIARWQGSAA GPDAPLLSLV
GKGVCFDTGG YDLKQPSGML RMKKDMGGAA VMLSLAHLIL TRDLPIRLEL RLGCVENSVS
GEAMRPSDVV VTRSGLTVEI GNTDAEGRLV LCDLLTDACA AAPDLLIDAA TLTGAARVAL
GPDLPALFSP DDAVAQAILE AGTAQCDPLW RLPLWDGYAD WLRSPVADLN NVSAKPMAGS
VTAALFLRNF VKTDVRWAHI DLYGWNDHYK PGRPEGGETP ILRAVYASLL RILNVADRVS
H