Gene TM1040_2829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2829 
Symbol 
ID4076648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2998674 
End bp2999771 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content60% 
IMG OID638008158 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_614823 
Protein GI99082669 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGA CGCTCACACT TCTTGGACTG GAAAGCAGTT GCGACGACAC AGCAGCCGCC 
GTTGTACGAC AGACGACAGG CGCCAAGGCG GAAATCCTGT CGTCGATCGT GTTCGGCCAG
ACCGAGCTTC ACAGCGCGTA TGGCGGCGTG GTGCCCGAGA TCGCCGCGCG CGCCCATGCT
GAAAAACTCG ACAGTTGCGT GCGCGACGCT CTCGCCGAGG CCGGCCTCAC CCTAGGCGAC
CTGGATGCCA TCGCCGTCAC CGCAGGCCCA GGGCTGATCG GGGGCGTGAT GTCTGGCGTG
ATGTGCGCAA AAGGGATCTC TGCTGCCACC GGACTGCCAC TGATCGGAGT GAATCACCTT
GCCGGGCATG CTCTGACGCC GCGCCTGACT GACGATATCA CCTATCCCTA CTTGATGCTG
CTGGTGTCAG GTGGCCATTG CCAATATCTG ATCGCACGCG GGCCAGAGAC GTTCTCGCGC
CTTGGCGGCA CAATAGACGA TGCACCCGGC GAAGCCTTTG ACAAGACCGC GCGACTTCTT
GGGTTGCCAC AACCCGGAGG ACCGTCTGTA CAGGCGGAGG CAGAGCATGG CGATCCGGAG
CGGTTCCGCT TTCCACGCCC TCTGCTTGAT CGCCCTGATT GCAACCTGTC TTTTTCGGGG
TTGAAAACCG CCTTGATGCG AATGCGGGAC CAGATCATCG CAGAAAAGGG AGGCCTGACA
CGTCAGGATC GCGCCGATCT CTGTGCTGGT TTTCAGGCGG CAATCGTTGA CACACTTGTG
GAAAAAACCC GTCGCGCCCT CCGGCTCTAT CTCGAGGATA AGCCCCAGCA CCCAACGTTG
GCCGTGGCCG GAGGTGTCGC TGCCAATACT GAAATCCGGA ACGGATTGAT GGCTCTATGC
TTTGAGTTAG AGACTGATTT TCTCGCACCC CCTTTGGCGC TTTGCACCGA TAACGCAGCG
ATGATCGCCT ATGCAGGACT GGAGCGCTAC AAAACCGGAG CGCGTGATGG CATGTCGCTT
TCGGCACGTC CGCGCTGGCC GCTGGATAAA ACCAGCCCAG CGCTGATCGG CAGCGGCAAG
AAAGGGGCCA AGGCATGA
 
Protein sequence
MTQTLTLLGL ESSCDDTAAA VVRQTTGAKA EILSSIVFGQ TELHSAYGGV VPEIAARAHA 
EKLDSCVRDA LAEAGLTLGD LDAIAVTAGP GLIGGVMSGV MCAKGISAAT GLPLIGVNHL
AGHALTPRLT DDITYPYLML LVSGGHCQYL IARGPETFSR LGGTIDDAPG EAFDKTARLL
GLPQPGGPSV QAEAEHGDPE RFRFPRPLLD RPDCNLSFSG LKTALMRMRD QIIAEKGGLT
RQDRADLCAG FQAAIVDTLV EKTRRALRLY LEDKPQHPTL AVAGGVAANT EIRNGLMALC
FELETDFLAP PLALCTDNAA MIAYAGLERY KTGARDGMSL SARPRWPLDK TSPALIGSGK
KGAKA