Gene Gdia_1207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1207 
Symbol 
ID6974611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1344543 
End bp1346474 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content66% 
IMG OID643390736 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_002275605 
Protein GI209543376 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.390556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATT TTGGCCGGAA CCTGGCTCTC TGGGTTATCA TCATCGTGCT GCTGCTGTTG 
CTGTTCAACG TCTTCCAGCC TGGAAGCGTG CAGCATGCGT CGCAGCAACT GGCATATTCC
GACTTCATCG GGGATGTGAA TGGCGGGCGC GTGCGCTCGG TCATCGTGCA GGATCACAAT
ATCTCCGGCA CGCTGACGGA CGGCACGTCG TTCGAGACCT ACACTCCCCA GGACCCGACC
CTGATCCCGC GCCTGACCGA AAAGGGCGTC GAGGTGGTCG CGAAGCCGCT GGACAGCGAT
TCCAATCCGT TCCTGCGTTA TCTGATCAAC TACGCCCCGA TCCTGCTGAT GTTCGGCGCC
TGGATTTTCA TCATGCGGCA GATGCAGGCC GGCGGCGGCC GGGCGATGGG TTTCGGCAAG
TCCCGCGCCC GCATGCTGAC GGAAAAGCAG GGCCGCGTGA CGTTCGATGA CGTGGCCGGC
ATTGACGAAG CCAAGAGCGA ACTGCAGGAA ATCGTGGACT TCCTGCGTGA CCCGCAGAAA
TTCACCCGCC TGGGCGGCAA GATCCCCAAG GGCGTGCTGC TGGTCGGCCC GCCGGGCACC
GGCAAGACCC TGCTGGCCCG CGCCATCGCG GGCGAGGCGA ACGTGCCCTT CTTCACCATC
TCCGGCTCGG ACTTCGTCGA GATGTTCGTC GGCGTCGGCG CGTCCCGCGT CCGCGACATG
TTCGAACAGG GCAAGAAGGC CGCCCCCTGC ATCATCTTCA TCGATGAAAT CGACGCCGTG
GGCCGCCATC GCGGCGCCGG CCTGGGCGGC GGCAACGACG AGCGCGAGCA GACCCTGAAC
CAGATGCTGG TCGAAATGGA CGGTTTCGAG AGCAATGAGG GCGTGATCCT GATTGCCGCG
ACCAACCGTC CCGATGTGCT GGACCCGGCC CTGCTGCGCC CCGGCCGTTT CGACCGCCAG
GTGGTGGTGC CCAACCCCGA CGTGGTGGGA CGCGAGAAGA TCCTGCGCGT GCACATGCGC
AAGGTCCCGC TGGCCTCCGA CGTCGATCCC AAGGTGATCG CGCGCGGCAC GCCGGGCTTT
TCGGGTGCCG ACCTTGCCAA CCTGGTGAAC GAGGCCGCGC TGATGGCCGC CCGGCTGGGC
AAGCGCACGG TCGCGATGCT GGAATTCGAG AACGCCAAGG ACAAGGTCCT GATGGGTGCC
GAGCGCCGCT CGCTGGTCAT GAGCGACGAC GAAAAGCGGA TGACCGCCTA TCACGAGGGC
GGGCACGCGC TGGTCGCGAT CCTGACCCCC GGCGCCGATC CGGTGCACAA GGCCACGATC
ATTCCGCGCG GCCGCGCGCT GGGCCTGGTC ATGAGCCTGC CGGAGGGCGA CCGCTATTCC
AAGAGCCGGG CGAAATGCCT GGGCGAGCTG ACGCTGGCCA TGGGCGGCCG CGCGGCCGAG
GAGATCATCT TCGGCGCCGA CAACGTGTCC AACGGCGCGT CGGGCGACAT CAAGATGGCG
ACCGACCTGG CCCGCCGCAT GGTTTCCGAA TGGGGCATGA GCGACAAGCT GGGCATGATC
GCCTATGGCG ATAACGGGCA GGAAGTCTTC CTGGGCCACA GCGTGACCCA GAACAAGAAC
GTGTCCGAGG AAACGGTCCG CGAGATCGAT GACGAGATCA AGATCCTGAT CGACAGCGCC
TATGCCCGGG CCCGGACGCT GCTGATCGAG CATGTCGACG AACTGCATCG CCTGGCCCAG
GCCCTGCTGG AGTACGAAAC CCTGTCGGGC GAGGAAATCC GCCAGGTCCT GCGCGGCGAG
CCGATCGAGC GGGTGGTGGT GGACGACCCG ATGCCGGAAA ATCGCCGCGC CTCGGTTCCG
CCCACGCCGC CGGCCGCGCC GCTGCCCTCG CCGGGCGGCG GGGGGCTGGA TCCGGCGCCG
CAGCCGGGCT AA
 
Protein sequence
MNNFGRNLAL WVIIIVLLLL LFNVFQPGSV QHASQQLAYS DFIGDVNGGR VRSVIVQDHN 
ISGTLTDGTS FETYTPQDPT LIPRLTEKGV EVVAKPLDSD SNPFLRYLIN YAPILLMFGA
WIFIMRQMQA GGGRAMGFGK SRARMLTEKQ GRVTFDDVAG IDEAKSELQE IVDFLRDPQK
FTRLGGKIPK GVLLVGPPGT GKTLLARAIA GEANVPFFTI SGSDFVEMFV GVGASRVRDM
FEQGKKAAPC IIFIDEIDAV GRHRGAGLGG GNDEREQTLN QMLVEMDGFE SNEGVILIAA
TNRPDVLDPA LLRPGRFDRQ VVVPNPDVVG REKILRVHMR KVPLASDVDP KVIARGTPGF
SGADLANLVN EAALMAARLG KRTVAMLEFE NAKDKVLMGA ERRSLVMSDD EKRMTAYHEG
GHALVAILTP GADPVHKATI IPRGRALGLV MSLPEGDRYS KSRAKCLGEL TLAMGGRAAE
EIIFGADNVS NGASGDIKMA TDLARRMVSE WGMSDKLGMI AYGDNGQEVF LGHSVTQNKN
VSEETVREID DEIKILIDSA YARARTLLIE HVDELHRLAQ ALLEYETLSG EEIRQVLRGE
PIERVVVDDP MPENRRASVP PTPPAAPLPS PGGGGLDPAP QPG