Gene Gdia_0213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0213 
Symbol 
ID6973605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp229880 
End bp231670 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content70% 
IMG OID643389744 
Productpeptidase M24 
Protein accessionYP_002274625 
Protein GI209542396 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.22897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0833498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGA TCGCTTCCCG CCTCCCTGCC CTGCGGACCG TGCTGGGGCA GATGGACGTG 
GATGGTTTCA TCCTGCTGCG CGGTGACGAG CATCTGGGGG AATATGTCGC ACCCTGTGCC
GAACGCCTGG CCTGGCTGAC CGGGTTCACC GGCAGCGCCG GCATGGCCGT GGTGCTGCGC
GACGGGCCGG CGGCGGTGTT TTCCGACGGC CGCTATGTCA CCCAGATGGA CCAGCAGGTG
GACGGCGCGG CCTGGTCGCG CCTGCATCTG CGCGACACGC CGCCGGCCCG CTGGCTGGCA
TCCCATGCCG GGGCGGGCCA GCGGATCGGC TACGATCCCC GGCTGGTCGG CGAGGCCGGG
TTGCAGCCCT TCCTCGATTG CGGGCTGACC ATGGTGCCGA TGGCGGCCAA CCCGGTGGAC
CGCATCTGGA CCGACCGGCC GGCGGCACCC GCCACGGCCT GCATGCCGCA GCCCCTGGCC
TTCGCGGGCG AGGACAGCGC CGCCAAGCGG GCACGGATGG CCGCCATCCT GAAGGCGGAC
GGCCAGGATG CCGCCGTGCT GGGCGACCCC ACCGCCATTG CCTGGTTGCT GAACGTCAGG
GGCCATGACG TTCAATACAC CCCCGTCTGC CTGGCCTTCG CCATCCTGCA TGACGATGCG
CGGGTGGACC TGTTCATCGA CCCCGCGCGC CTGCCGCAGG ATACGGCGGC GTGGCTGGGC
CCCGAGGTGA CGATCGTGGA GCCGGCGGGG CTGGAGGCGG CACTGGCGGC GTTGGCCGGA
CGGCGGGTGC GCGTCGATCC GGTCGGGACC GCCATATGGT TCATCCAGAC GCTGGAGGCG
GCCGGGGCGA CGGTGGCGCG CGGCGGCGAC CCGTGCGTGC TGCCCCGCGC CCGGAAGAAC
GATGTCGAGC AGGACGGCGC ACGGCGGGCG CATCTGCTGG ACGGGATCGC GCTCTGCCGT
TTCCTGCACT GGATGGATAC CGAGGGCGTG GGCCCGGATA GCATAAGGCC GGGAGAACTG
GACGCCGCGA ACCGGCTGGA CGCGTTCCGC GCCCTGTGCC CGGACTATCG CGAGGAAAGC
TTTCCCGCGA TTTCCGGGGC CGGCCCCAAC GGCGCGGTCA TCCATTATCG CGTGACCCCC
GAAAGCAGCC GGACGATCGG GACGGACGAG GTCTATCTGA TCGACAGCGG CGGGCAGTAT
CCGTTCGGCA CCACCGACGT CACGCGCACG ATCTGGACCG GCGCCGGCCG AGGGCCGGAG
GATGTGCGCC ACGCCTTCAC CCGCGTGCTG AAGGGGCATA TCGCCCTGGC GCGGGCCCGC
TTTCCGGTGG GCACCACCGG GCACGCGCTG GACGGGCTGG CGCGCTATGC GCTGTGGCAG
GCGGGAATGG ATTACGACCA TGGAACCGGC CACGGCATCG GCAGCTATCT GTCGGTCCAT
GAGGGACCGT GTTCGATTTC GCCCGTCTAT CGGCCCGTCG CGGTCGAGGC CGGCATGATC
CTGTCCGACG AGCCCGGATA TTACCGGCCC GGCGCCTTCG GCATCCGGCT GGAAAACCTG
CTGCTGGCCC GCCCGGCACC GGCCGAGCCC AACCGGTCGT TCCTGGAGTT CGAGACGCTG
ACGCTGGCGC CGTTCGACCG GCGGCTGATC GACGCGTCCC TGCTGACGGC GGAGGAAACC
GCATGGATCG ATGCGTACCA TGCACGGGTT TGTGAAACGC TTGCCCCGCA TCTGGAGGCT
GCACCCACGG CATGGCTGCA TGCCGCATGT GCCCCGATCG GCGCGGAATA G
 
Protein sequence
MTAIASRLPA LRTVLGQMDV DGFILLRGDE HLGEYVAPCA ERLAWLTGFT GSAGMAVVLR 
DGPAAVFSDG RYVTQMDQQV DGAAWSRLHL RDTPPARWLA SHAGAGQRIG YDPRLVGEAG
LQPFLDCGLT MVPMAANPVD RIWTDRPAAP ATACMPQPLA FAGEDSAAKR ARMAAILKAD
GQDAAVLGDP TAIAWLLNVR GHDVQYTPVC LAFAILHDDA RVDLFIDPAR LPQDTAAWLG
PEVTIVEPAG LEAALAALAG RRVRVDPVGT AIWFIQTLEA AGATVARGGD PCVLPRARKN
DVEQDGARRA HLLDGIALCR FLHWMDTEGV GPDSIRPGEL DAANRLDAFR ALCPDYREES
FPAISGAGPN GAVIHYRVTP ESSRTIGTDE VYLIDSGGQY PFGTTDVTRT IWTGAGRGPE
DVRHAFTRVL KGHIALARAR FPVGTTGHAL DGLARYALWQ AGMDYDHGTG HGIGSYLSVH
EGPCSISPVY RPVAVEAGMI LSDEPGYYRP GAFGIRLENL LLARPAPAEP NRSFLEFETL
TLAPFDRRLI DASLLTAEET AWIDAYHARV CETLAPHLEA APTAWLHAAC APIGAE