Gene Gdia_0685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0685 
Symbol 
ID6974082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp774344 
End bp776326 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content67% 
IMG OID643390214 
Productpeptidase M61 domain protein 
Protein accessionYP_002275090 
Protein GI209542861 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones110 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.00148081 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCGTCCGC TTTCCAAACG CTTCCTCGCC CTGTCCCTGA CCACCGCCCT GGTGGCCGCC 
GGCATGGCGG GCCGTCCGTA TTCCGTGCGC GCGGCCGACA GCCAGCCGCA ACCCCAGCCC
CTGGCGCTTC CAGATTCCGT GCCGGTGCCG CGTGACGTGC CGTTCGGCGG AACCATTCAT
CTGACGGTCG ATGCCACCGA CCTGGCCCGG CGCGTCATGA CGGTGCGTGA AAGCGTGCCG
GTCCCTGCTG CGCTGGCGGA AACGGGCGGC GACATGACCC TGCTCTACCC GATGTGGGTG
CCGGGCGACC ATTCGCCGAC CGGCACGATC GAGCAGTTCG GCGGCCTGGT GGTCAAGGCC
GGCGTCGCCC CGCTGGAATG GGTGCGCGAC ACGGTGATGG TCAGCGCCTT CCATGTCCGG
GTGCCCAAGG GCGTGCATGC GCTGAACCTG TCGTTCCAGA TTCTGTCGCC CGTGTCGCCC
GAGGAAGGGC GCGTGGTCAT GACGCCGGAG ATGCTGAACG TCCAGTGGAA TGCGGTGGCG
CTGTACCCGG CGGGGTATTT CACGCGCCAG ATCCCGGTGC AGGCGGAACT GCGTCTGCCC
CACGGATGGC AGTACGGCAC CGCGCTGCGC GCCCATGGCC CGGCGGGCGA CGTCGTCACG
TTCGACCCGG TGCCGTTCAA CACGCTGGTG GATTCCCCGC TGTTCGCCGG GCGATATTTC
CGCCGGTTCG ACCTCGCCCC CGGGGCTGCG GCGCCGGTAA CGCTGAACGT GATGGCGGAC
AAGCCGGCGG AACTGGCGGC CACCGACGCC CAGGTCAAGG CGCACCAGGA TCTGGTGACG
CAGGCAGGGC TGCTCTTCGG CTCGCACCAT TACGACCATT ACGATTTCCT GCTGGCCCTG
ACCAATGAAC TGGGGCGGAT CGGGCTGGAA CATCACCGCT CCAGCGAGAA CAGCGGTCCG
CGCGGCTACT TCACCGAGTG GGACAAGACC TTCGCCGTCC GTGACCTGCT GGCCCACGAA
TACACCCATT CCTGGAACGG CAAGTATCGC CGCCCCGCCG ACCTGTGGGC GCCGAACTTC
AACACCCCGC AGCGCGGGTC CGGCCTGTGG GTGTATGAAG GACAGACCCA GTACTGGGGC
TATGTCCTTT CCGCCCGGTC GGGCCTGATG ACCGAACCGC AGGTGATCGA CGCGCTGGCC
GAGGTGGGCG CGATGTACGA CACCCGCCAG GGACGCACCT GGCGTCCGCT GCAGGACACG
ACCAACGATC CGGTGATCGC CCAGCGTTCG CCCCTGTCCT GGCGGAGCTG GCAGCGCAGC
GAGGATTATT ATTCCGAAGG CCAGCTGGTC TGGCTGGACG CCGATACGCT GATCCGCAAG
CTGTCGGACG GCAAGCACTC GCTGGACGAT GTCGCGCGCC ATTTCTTCGG CACCAATGAC
GGCAGCTTCG TCACCAGCAC CTACCAGTTC GCCGACGTGG TGAAGGCGCT GAACGACATC
CAGCCTTATG ACTGGGCCAC CTTCCTGCGC CAGCGGCTGG ACCGCACATC GACCCATGCG
CCGCTGGACG GCTTCACGCG CGGGGGATAC CGCCTGGTCT ATACCGACCA GCCGAACGAC
TGGGTCAAAT CCTTCGCCGC CATGCGGCAC GTGACGGATT TCAGCTTCTC GCTGGGCTTC
CTGGTCGGCA AGGGCGGTAC GCTGGCCAAC GTCGAATGGG GCAGCCCGGC CTGGCAGGCC
GGCGTGACCC GCGACGCGCA ACTGGTCGCG GTGAATGGCG AGGCCTACGA TCCCGACGTG
CTGCAGGACG CGATCACGGC GGCACGGGAC CCGCACGCGG CCCCGATCGC ACTGCTGCTG
CACATGGGGG ACCACTACGT CACGATCGCC ATCCCGTATC ACGGCGGCCT GCGCTATCCG
CATCTGCAGC GGATACCGGG CACGCCCGAC ATGCTGGCTG ACATCCTTGC ACCGCGCCAC
TGA
 
Protein sequence
MRPLSKRFLA LSLTTALVAA GMAGRPYSVR AADSQPQPQP LALPDSVPVP RDVPFGGTIH 
LTVDATDLAR RVMTVRESVP VPAALAETGG DMTLLYPMWV PGDHSPTGTI EQFGGLVVKA
GVAPLEWVRD TVMVSAFHVR VPKGVHALNL SFQILSPVSP EEGRVVMTPE MLNVQWNAVA
LYPAGYFTRQ IPVQAELRLP HGWQYGTALR AHGPAGDVVT FDPVPFNTLV DSPLFAGRYF
RRFDLAPGAA APVTLNVMAD KPAELAATDA QVKAHQDLVT QAGLLFGSHH YDHYDFLLAL
TNELGRIGLE HHRSSENSGP RGYFTEWDKT FAVRDLLAHE YTHSWNGKYR RPADLWAPNF
NTPQRGSGLW VYEGQTQYWG YVLSARSGLM TEPQVIDALA EVGAMYDTRQ GRTWRPLQDT
TNDPVIAQRS PLSWRSWQRS EDYYSEGQLV WLDADTLIRK LSDGKHSLDD VARHFFGTND
GSFVTSTYQF ADVVKALNDI QPYDWATFLR QRLDRTSTHA PLDGFTRGGY RLVYTDQPND
WVKSFAAMRH VTDFSFSLGF LVGKGGTLAN VEWGSPAWQA GVTRDAQLVA VNGEAYDPDV
LQDAITAARD PHAAPIALLL HMGDHYVTIA IPYHGGLRYP HLQRIPGTPD MLADILAPRH