Gene Gdia_1817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1817 
Symbol 
ID6975239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2013769 
End bp2015448 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content71% 
IMG OID643391342 
Productpeptidase M28 
Protein accessionYP_002276192 
Protein GI209543963 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.157463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCCC ATCGTCCGCG CGCCGCCCTT GTATCCGCCA TGCTTCTGGC GGCGGCGTCC 
CCCGCGCATG CCGCGTCCCC GTGGGCGCCG ATCGACCCCG GCCGCATGTC GGCCACCATC
CGCACGCTGG CGTCCGACGC GTTCGCCGGG CGCGCGCCCG CCACGGCCGG TGAGGCGAAG
ACCGTGGACT GGCTGATCGC CCAGTATCGC GACATCGGGC TGGAACCCGG TGGCGAGAAT
GGCGGCTGGA CGCAGAGCGT ACCGCTGCTG CGGACCCGGA TCGGCACCCC GGCCCGGCTG
GACGCCACCA TCAACGGGGC GCCGATGGCG CTGGAGCTGA AGAAGGACAT CTACCTGACC
ACCCTGTCGC CGGTCACACG CATCAGGGTG GACGCGGCGC CGATGGTCTT CGTGGGGTAC
GGCGTGAACG CGCCCGAACG CCATTGGGAC GATTACAAGG GCGTGGACCT GAAGGGAAAA
GTTGCCGTCT TCCTCATCAA CGATCCGGAT TTCGACGCCA GACCGGGCGA GGCGGTGGCC
GGACGGTTCG GCGGCCGGAC GATGACCTAT TACGGCCGCT GGACCTACAA ATACGAGGAA
GCGGCCCGAC GCGGCGCCAT CGCCGCCCTG ATCGTGCATG ACACGCCGGG CGCGTCCTAT
CCATGGACCA CGGTCATCGC GCCGGGCGGC GAGGCCTTCG ACATCGTGCG GCAGGGCGAT
GCGAACAAGC CGGTGCCGCT GCAAGGCTGG CTGGAGGGCG ACGCCGCGCA CCGCCTGTTC
GCCCGCGCGG GGCTGGACCT TGCGGCGCTG CGCGTGAAGG CGCGCGACCC GGATTTCCAT
CCGGTCACGC TGCCCGGTAC GACCCTGACG GCAGACCTGC CGGTCGAAAC CGCGACATTG
CAGAGCCGCA ACGTGATCGG CAAGCTGACC GGCGCCCGCC ATCCCGACGA GACGGTCATG
TACGGGGCCC ACTGGGACGC ATTCGGCGTC GGCACGGACG CACAGGGCCG GCAGGTGATC
CGGCACGGCG CCGTGGATGA CGGATCGGGA ATTGCCGCGA TCCTGGAAAT TGCCCGCGCG
TTCAAGGCCG GGCATCGGCC GGACCGGACG GTCCTGTTCG CCGCCTGGAC CGCCGAGGAA
CGCGGGCTGC TGGGTTCGAC GTGGTATGCC GCCCACCCGC TGGCACCACT GGCCAGGACA
GCGGCGAACT TCACCATCGA CGTCCTGCAG ACCGCCGGCC CGGCCCATAA TGCCTTCATC
ATCGGCGCGG GACAGGACAC GCTGCAGGAC GACCTGACGG AAGCCGCCCG CGCGCAGGGA
CGCGTCACGC AGCCCGAGGC CAGGCCCGAA CGCGGTGCCT TCTACCGCGC CGACCACCTG
CCCTTCGCCC ATGCCGGCGT GCCCGTCGTG GCCATCATGG GCATGGCCGG CCCCTACGAC
CTGCTGTCCG GCGGCATCCC GGCCGGCGCG GCATGGCTGA AGGCCTACGC CGCCTGTTAT
CACCAGCCCT GCGACACCTG GGACCCGCAC TGGGACCTGC GCGGCGCGGC GGAAGATGCC
GCCCTGGTCT ATCAGGTCGG CCGGACCGTC GCGTTCTCGC ACACCTGGCC CCAGTGGAAA
CCCGGATCGG AATTCGCCGG CATTCGCGCG GCGAGCGCGG CCGAGCGGGG CGAGCCGTAG
 
Protein sequence
MPPHRPRAAL VSAMLLAAAS PAHAASPWAP IDPGRMSATI RTLASDAFAG RAPATAGEAK 
TVDWLIAQYR DIGLEPGGEN GGWTQSVPLL RTRIGTPARL DATINGAPMA LELKKDIYLT
TLSPVTRIRV DAAPMVFVGY GVNAPERHWD DYKGVDLKGK VAVFLINDPD FDARPGEAVA
GRFGGRTMTY YGRWTYKYEE AARRGAIAAL IVHDTPGASY PWTTVIAPGG EAFDIVRQGD
ANKPVPLQGW LEGDAAHRLF ARAGLDLAAL RVKARDPDFH PVTLPGTTLT ADLPVETATL
QSRNVIGKLT GARHPDETVM YGAHWDAFGV GTDAQGRQVI RHGAVDDGSG IAAILEIARA
FKAGHRPDRT VLFAAWTAEE RGLLGSTWYA AHPLAPLART AANFTIDVLQ TAGPAHNAFI
IGAGQDTLQD DLTEAARAQG RVTQPEARPE RGAFYRADHL PFAHAGVPVV AIMGMAGPYD
LLSGGIPAGA AWLKAYAACY HQPCDTWDPH WDLRGAAEDA ALVYQVGRTV AFSHTWPQWK
PGSEFAGIRA ASAAERGEP