Gene TM1040_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2000 
Symbol 
ID4077457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2103846 
End bp2105762 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content63% 
IMG OID638007315 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_613994 
Protein GI99081840 
COG category[R] General function prediction only 
COG ID[COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACCT TGAACCGCAG AAATCTGTTG AAGGGCAGCG TTGCGGCGGC CTGTTTGACG 
GTGCCCGCCG TGCGCGCAGT TGGGCAGACG AACTCGCTCT CGGAGATCGG CAAGTTGTTT
GCACCGGTGC CTCGGGCAAA GATCTTCACC GCGCGCGATA TTGTCACGCT TGACCCTTCT
CAACCCTCGG CCGAGGCCAT CGCGGTTGTC GGCACGCGAA TCCTCGCGGT GGGCGCGCTC
AGCGAGGTGC AGGAGATGCT CGGGGATCAA CCCTTTGATG TGGATGACAG TTTCGCCGAC
AAGGTCATCG TGCCAGGGTT TATCAGCCAG CATGATCATC CGGTTCTGGC GGCGCTGTCG
ATGTCGTCCG AGATCCTGTC GATCGAGGAG TGGGCCTTGC CGACGGGCAC CGTGCCTGCG
GTCAAGGACA AGGCGGACTT CATGAAACGC CTCACGGCAG CCGTGGAGGC GCGTACCACC
CCCGGAGAGC CGGTGGTGAC CTGGGGGTAT CATCCCGCCT TTTACGGTCC TTTGACACGC
GCGGATCTCG ACGGGATCAG CACCGAGCGC CCCATTCTGG TGTGGGGTCG CTCCTGCCAC
GAGATGATCC TCAATAGCGC CGCCCTGACG GCCGGTGGGG TGACACAGGC CGCGGTGGAG
GCCTTCGACG CGGCCTCGCA GAAACAGGCC AACCTTGCGG AGGGCCATTT CTGGGAGCAG
GGGCTTTTTG CGGTTCTGCC TCATATTGCG TCTTTGGTGG CCACGCCCGA ACAGCTGCGC
GCTGGGCTCG AACTCAGCCG TGACTTCATG CACAGCAAGG GCATCACCTT TGGCAACGAG
CCCGGCGGCA TTCTTGCAAA ACCCGTGCAG GACGGAGTCA ATGCGGTGTT TTCCAGCCCG
GATATGCCGT TTCGCTGGTC GTTCATGGTC GATGCCAAGA GCATGGTCGC CAGTTACGCC
GATGACGGCG AGGTCATTGC CCGGTCAGAG GCGCTTCAAT CCTGGTACTA CGGGATGACA
AGCCTCGCGC CGCGCCAGGC CAAACTGTTT TCGGATGGGG CGATCTATTC GCAGCTCATG
CAGGTGCGCG CGCCCTATCT CGACGATCAC CACGGCGAGT GGATGATGGA GAAAGAGCTG
TTTGAGCGCG CGTTCAGGGT CTACTGGGAT GCGGGGTATC AGCTGCACAT CCATGTCAAC
GGTGATGCCG GGTTGGATCG TGTGCTGGAG ACGCTCGAGA CCAACATGCG TCGCAATCCG
CGTTTTGATC ACCGTACGGT CATCGTGCAT TTTGCCGTCA GCGCCTTTGA CCAGGTGGAG
CGGATCAAGG CGCTCGGGGC CATCGTGAGC GGCAATCCCT ATTATGTCAC GGCGCTTGCG
GATCAGTATT CCGAAGTGGG TCTTGGCGCC GAGCGGGCAG ACAGCATGGT GCGTCTCGGG
GATCTCTCGC GGGCCGGGGT GCGTTGGTCG CTCCATTCGG ATATGCCGAT GGCGCCAGCT
GACCCGTTGT TCCTGATGTG GTGCGCGGTG AACCGGGTGA CCACGTCGGG CCGGGTGGCG
GCGCCGGAAC AGGCGGTGTC CGCCGAGGAC GCGCTGCGCG GCGTCACCAT CGAAGCGGCC
TATTCGCTGC AGATGGAAGA AGAAATCGGC AGCCTCGTGC GTGGCAAGCG GGCGAATATG
ACGATCCTTG CCGAGAACCC GCTCGAGGTG GATCCCATGG CGATCCGCGA GATCGAGGTC
TGGGGCACGG TGATGGAGGG GCGGGTGCTG CCGGTCCGTT CGAGTGATCG AGCCGCAGCA
ACGCAGCAGG ACGCCTCGGC TGGTCCGCGT GAGGCGGTTT CACCAAATGG AGCGCCAGCC
TTTGACAAAG CTGCCCTTGA GCATGCCCTG AAGGTCACGC ACGCGCATCA TCTCTGA
 
Protein sequence
MRTLNRRNLL KGSVAAACLT VPAVRAVGQT NSLSEIGKLF APVPRAKIFT ARDIVTLDPS 
QPSAEAIAVV GTRILAVGAL SEVQEMLGDQ PFDVDDSFAD KVIVPGFISQ HDHPVLAALS
MSSEILSIEE WALPTGTVPA VKDKADFMKR LTAAVEARTT PGEPVVTWGY HPAFYGPLTR
ADLDGISTER PILVWGRSCH EMILNSAALT AGGVTQAAVE AFDAASQKQA NLAEGHFWEQ
GLFAVLPHIA SLVATPEQLR AGLELSRDFM HSKGITFGNE PGGILAKPVQ DGVNAVFSSP
DMPFRWSFMV DAKSMVASYA DDGEVIARSE ALQSWYYGMT SLAPRQAKLF SDGAIYSQLM
QVRAPYLDDH HGEWMMEKEL FERAFRVYWD AGYQLHIHVN GDAGLDRVLE TLETNMRRNP
RFDHRTVIVH FAVSAFDQVE RIKALGAIVS GNPYYVTALA DQYSEVGLGA ERADSMVRLG
DLSRAGVRWS LHSDMPMAPA DPLFLMWCAV NRVTTSGRVA APEQAVSAED ALRGVTIEAA
YSLQMEEEIG SLVRGKRANM TILAENPLEV DPMAIREIEV WGTVMEGRVL PVRSSDRAAA
TQQDASAGPR EAVSPNGAPA FDKAALEHAL KVTHAHHL