Gene TM1040_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1802 
Symbol 
ID4076948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1893699 
End bp1894817 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content61% 
IMG OID638007117 
Producthypothetical protein 
Protein accessionYP_613797 
Protein GI99081643 
COG category[L] Replication, recombination and repair 
COG ID[COG4335] DNA alkylation repair enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.462091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0113105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGT CGTTTTCGCT GAAGGATCAG CTTTTTAACG TGGAAAAGAC CCGCTATCTG 
GCGGGGCTTT TTGACGCGGC TTCGGTGGAG TTTGACCCGC GCGCCTTTGA GGCGGATGTG
ATGGCGCGGC TCCTTGATCT TGAGCTCAAG GCGCGCATCA ACTGGATCGC CGAGATGCTG
TCAAAGCATG TCCCGGGTCC GCTCGATCAA GTTGCACCGG TGATTTTTGC GGCTCTGCCG
CCACCTTTGG ACCCAAGCCT GCGCGATGAT GACTTTGGCG ATTTCATCTT TGCTCCGCTT
GGGGAATGGG TTGCGGATCT CACCAAGACC GAGGCCGATC TGCCTCTGGC GCTGGATCTA
CTGGAGGCCG TAACACAGCG TTTTTCCATG GAGTTTGCGA TCCGCCCGCT GTTGAAAACC
TGGCCCGATC CTGTGCTTGC GCGCATGTCG CGTTGGGCCG GGCACGCGCA TTATCATGTG
CGAAGGCTTG CCAGTGAGGG CACGCGCCCG CGGCTGCCCT GGGGCCTTGC GGTAAATCTG
CCGTTGGATG CGCCCTTGCC GATCCTAGAC CGTCTTCACG GCGATGACAC ACGTTTTGTC
ACACGTTCGG TGGCCAATCA CTTGAATGAC ATCGCCAAAA AAGACCCGCA GATCGTGGTG
GACCAACTGA CCGCGTGGCA AGCGCGCGGC GAGCAGGCCC AAAAAGAGCT GGACTGGATG
ACCTCCCATG CGCTGCGTGG CCTGATCAAG GCGGGCGATC CCCGCGCGCT GCGACTGCTT
GGGTATGATC CGGAACTTGA TCTCTCGGCA GAGCTTGAAC TGCCCGGACG CGTGCGGATC
GGTGAAAAAC TGATGTTGGG CGCGCGGCTG CAGGGGGGCA GGGGCGCGCG GGTGCTGGTG
GATTATGCCC TGACCTTTCA ACGGGCTGGT GGCAAGACCT CAACCAAGGT GTTCAAGTGG
AAGACCGGCA CGCTGGGCGC TGACGGCCTG AGCTTGCAAA AGACGCATCC GCTTAAGGCG
CAGGCCTCGA CCTTTACGCT GTTGCCGGGG GCGCATCGGG TGACGCTGAT GGTCAATGGC
CAGCCCCGGG CAAGCGGCGA GGTGGAGTTC CTTGCCTGA
 
Protein sequence
MAESFSLKDQ LFNVEKTRYL AGLFDAASVE FDPRAFEADV MARLLDLELK ARINWIAEML 
SKHVPGPLDQ VAPVIFAALP PPLDPSLRDD DFGDFIFAPL GEWVADLTKT EADLPLALDL
LEAVTQRFSM EFAIRPLLKT WPDPVLARMS RWAGHAHYHV RRLASEGTRP RLPWGLAVNL
PLDAPLPILD RLHGDDTRFV TRSVANHLND IAKKDPQIVV DQLTAWQARG EQAQKELDWM
TSHALRGLIK AGDPRALRLL GYDPELDLSA ELELPGRVRI GEKLMLGARL QGGRGARVLV
DYALTFQRAG GKTSTKVFKW KTGTLGADGL SLQKTHPLKA QASTFTLLPG AHRVTLMVNG
QPRASGEVEF LA