Gene Gdia_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1647 
Symbol 
ID6975063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1835909 
End bp1837699 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content72% 
IMG OID643391182 
Productpeptidase S41 
Protein accessionYP_002276039 
Protein GI209543810 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.754893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGC CATCCGTTCC CGTTCCGGCG GGCCCCGTTT CGGCGGGCTG GATCCCCGCC 
CCGCTCCGTG GGGCCGGGCG ACGGGTCACG TCCGTGATCC TGCCGACGCG CACGATCGCC
ATCATGCTGA TCGTCATCCA CCTGATGACG CCCGTCCTGG CCCCCGTGGC GGCGCGCGCC
GCCGGCACGC CCGCGCCACC CCCGCCCCCC CGGCCCGCGC CGACCCCCGG ACAGGATACG
AGGAGCCAGG ATACGAGCCA GATTCTGGTC CAGGCGGACC CGGTGCCCCA GGCGGGCCAG
TTGGACGCGG ACATGACGAT ATCGGTCGTG AACGCGGCCC TGACCTTCCT GCTGCCCCGA
ACGCTGGAGA GCCACACGCC GCGCGATTTC TGCCTGTGGG GCCTGAACGG GCTGAGCGCG
ATCGACCCGT CCCTGACCGT CGTCGAGCAG AAGGGGCCCG AACAGAAAGG GGCGAACCAG
AACGGGATCA TCCAGCTTTC GCTGGGGCAG GAAATCGTGC TGCGCCTGCC CGCCCCCCCC
GATTCCGACC AGGCGGGATG GACGGACCTG ACCGTGCGGT TGATGCAGGC CGCCTGGGCC
CGGTCGGGCA CGGTGCGCGG CGCCGGCGCG GACGGTCTGA TGCAAAGCTT CTTCGACGAA
CTGTTCAACC ACATGGACCC GTATTCGCGT TACGTCGCGC CCAGCCCGGC CACGACCGAC
CGCGACACGC GCACCGGCGG CGAGGCCGGG ACCGGGCTGA CATTGGGCCG CGACGCACGC
TCGATCCTGA TCACCGGGGT CAATGCCAAC GGGCCGGCCT GGCCGGCGGG TCTGGCGACG
GGCCAGCGGC TGTACGCGGT CAATGGCCGT TCGACCCGTG ACCAGGCGCC CGGAACGGTC
GCCCAGTGGC TGCTGGGTGC GCCCGGCAGC ACGGTGACGG TGACGGTTGG CGACGGGCGC
GCCCGGCGCA CCGTGACGCT GCGCCGGGCC TCGGTCCCGC CGGAGACGGT CTTCGCCTAT
GCCGCGGAAC ATATGGTGGT GATCCGCGTC ACCGCGTTTT CGGCCGACAC CGCGCAGGAA
ATGAGCCAGT ACCTGGACCA GGCATCCGAC GACCAGCACC TGCGCGGACT GGTGCTGGAC
CTGCGCGGCA ATCGCGGCGG GGTGCTGCAG CAGGCGGTGA CGGCCAGCGC GCTGGTGCTG
GACCAGGGCG TGGCGGCGAT CACCCACGGG CGGGACCCGG AGGCCAACCA TGTCTGGGCG
GTGCAGGGCG GCGACATGAC CGGGGGCGTG CCGATCGTGG TGCTGGTGGA CGGGCGGACC
GCCAGCGCGG CCGAGATCCT GGCCGCCTCG CTGGCCGACC ACCGGCGCGC CGTGGTGGTG
GGCAGCGCCA CGCTGGGCAA GGGACTGGTG CAGACGATCG GCCAGATGCC CGACGGCGGT
GAATTGTTCG TGACCTGGAG CCGCGTCCTG GCGCCGCTGG GCTGGCCGCT GCAGGGGCTG
GGCGTCATGC CGCAGGTCTG CACCAGCCGG GGCGAAAGCG ACCTGGAACG GCAGTTGCAG
GACCTGGCGG CCGGCCAGGT GGACATGCGC GACGCCGTCC AGGCGACACG CGCCACGCGC
TATCCCGTGC CGGTGTCGCG CATCCTGGAC CTGCGCCGCG CCTGCCCGGC GGCGATCGGC
ACCGATTCCG ACCTGGATGC CGCGCGGTCG CTGATCGACA ATCCCGCCGA ATATCGCGCC
GCCCTGTCCG CCATCCCCGA GGAGAGCCCC TATGCGCCGC AGGCTGAATA A
 
Protein sequence
MRAPSVPVPA GPVSAGWIPA PLRGAGRRVT SVILPTRTIA IMLIVIHLMT PVLAPVAARA 
AGTPAPPPPP RPAPTPGQDT RSQDTSQILV QADPVPQAGQ LDADMTISVV NAALTFLLPR
TLESHTPRDF CLWGLNGLSA IDPSLTVVEQ KGPEQKGANQ NGIIQLSLGQ EIVLRLPAPP
DSDQAGWTDL TVRLMQAAWA RSGTVRGAGA DGLMQSFFDE LFNHMDPYSR YVAPSPATTD
RDTRTGGEAG TGLTLGRDAR SILITGVNAN GPAWPAGLAT GQRLYAVNGR STRDQAPGTV
AQWLLGAPGS TVTVTVGDGR ARRTVTLRRA SVPPETVFAY AAEHMVVIRV TAFSADTAQE
MSQYLDQASD DQHLRGLVLD LRGNRGGVLQ QAVTASALVL DQGVAAITHG RDPEANHVWA
VQGGDMTGGV PIVVLVDGRT ASAAEILAAS LADHRRAVVV GSATLGKGLV QTIGQMPDGG
ELFVTWSRVL APLGWPLQGL GVMPQVCTSR GESDLERQLQ DLAAGQVDMR DAVQATRATR
YPVPVSRILD LRRACPAAIG TDSDLDAARS LIDNPAEYRA ALSAIPEESP YAPQAE