Gene Gdia_0622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0622 
Symbol 
ID6974019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp697682 
End bp698929 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content69% 
IMG OID643390153 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_002275029 
Protein GI209542800 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.141981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCCG CATCGTCCCA TGACGGCCTG CTGGACCGGC TGGCCGAGGT CGCGGTCCGA 
ACCGGACTGA ACCTGGCGCC GGGCCAGCAA TTGCTGATCA CCGCGTCGCT GGACGCGGTT
CCGCTGGTGC GCCGCATCAC CGAACACGCC TATCGCGCCG GCGCGTCGCT GGTCACGCCG
TTCTTCTCGG ACGACGAGAT GACGCTGGCG CGATTCCGCC ACGCCCCGGA TGCGTCCTTC
GACGTCGCGG CCGGCTGGCT GCAGGACGGC ATGGCCAACG CCTATCGCCA GGGCGCCGCG
CGGATGGCCG TGACCGGCGG CAACCCCACC CTGCTGGCCG GGGAAGACCC GGACCGCGTC
GCCCGCGCCG GCAAGGCCAG TTCGCTGGCC GGCCGCCCGG CGATGGAACT GATCACCAAT
TTCGCCGTCA ACTGGAACAT CGTCGCCTGC GCCACCCCGG CCTGGGCCGC GCAGGTGTTC
CCCGACGATG CCCCGGACAG GGCGCTGGCC CGGCTGTGGG ACGCGATCTT CCTGGCGTCC
CGCGTCACGG TGGACGACCC CGTGGCGGCG TGGGTCGAGC ATAACGACAC GCTGCACCGC
CGCGCCGACT GGCTGAACGA GCGCCGGTTC GCGGCGCTGC AGTTCACCGG GCCGGGTACG
GACCTGACGG TGGGGCTGGC GGACGGCCAT GCCTGGGCCG GCGGGTCGGA ACCGGCGCGC
AACGGCATCG TGTGCAACCC CAATATCCCG ACCGAGGAAG TCTTCACCAC GCCGCACGCG
CGGCGGGTCG AGGGCTATGT CCGCGCGACG AAGCCCCTGT TCCACCAGGG CACGCTGATC
GACGGCATCG CGGTCCGCTT CGCCGACGGG CGCATCGTCG AAGCGCATGC GACCGAGGGG
CTGGAGGTGC TGGAACGCAT CCTGGACACC GACGAGGGCG CCCGCCGGCT GGGCGAGGTG
GCGCTGGTGC CGCATTCCTC GCCGATTTCG CAGAGCGGCG TGCTGTTTCG CAACACGCTC
TTCGACGAAA ACGCGTCCAG CCATATCGCG CTGGGCCAGG CCTACACGAA ATGCATGCTG
GATACCGAGA ACCAGACGCC CGAGCAGATC CAGGCCCGTG GCGCCAACAG CAGCTTCATC
CATATCGACT GGATGATCGG CTCGGCCGAG ATCGACGTGA CCGCCATCAC CCAGGATGGC
GCGTGCGAAC CCCTGATGAA ACATGGTGAG TGGGTCAACA AAGTATGA
 
Protein sequence
MTSASSHDGL LDRLAEVAVR TGLNLAPGQQ LLITASLDAV PLVRRITEHA YRAGASLVTP 
FFSDDEMTLA RFRHAPDASF DVAAGWLQDG MANAYRQGAA RMAVTGGNPT LLAGEDPDRV
ARAGKASSLA GRPAMELITN FAVNWNIVAC ATPAWAAQVF PDDAPDRALA RLWDAIFLAS
RVTVDDPVAA WVEHNDTLHR RADWLNERRF AALQFTGPGT DLTVGLADGH AWAGGSEPAR
NGIVCNPNIP TEEVFTTPHA RRVEGYVRAT KPLFHQGTLI DGIAVRFADG RIVEAHATEG
LEVLERILDT DEGARRLGEV ALVPHSSPIS QSGVLFRNTL FDENASSHIA LGQAYTKCML
DTENQTPEQI QARGANSSFI HIDWMIGSAE IDVTAITQDG ACEPLMKHGE WVNKV