Gene TM1040_3597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3597 
Symbol 
ID4075024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp645205 
End bp646596 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content61% 
IMG OID638005116 
Producthypothetical protein 
Protein accessionYP_611826 
Protein GI99078568 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCG AATCCGCTCT TTCCCGCGCC AGCCGCTATT TTGAGGCAGG AGATTTCCAG 
TCCGATCTGG CGCATCTTGT CACCTATCGA AGCGAAAGCC AGAACACCGC GCCCGAGGCG
CGCATGGAAT GTCGGCGCTA TCTCGAAGAG GCAATGCTGC CGCGGCTGCG TGCGCTCGGA
TTTGATTGCG AGATCATCGA CAATCCCGAC CCGACCGGCG GTCCCCTGCT GATTGGTGAG
CGGCGCGAAG GCGAGGCACT CCCCACCATA CTGACCTACG GGCACGGTGA CGCAGTGCTG
GGACAAGAAG GACGCTGGCG CGAGGGGCTA GAGCCCTGGG TGCTCGTCGA AGAAGGCGAT
CGCCTCTATG GTCGCGGCAC CGCCGACAAC AAGGGTCAAC ACCTGATCAA CATCGCAGCG
CTCGAAGCCG TTCTTGCAGA ACGCGGCCAC CTCGGCTTCA ACACGCGCAT TGTCATTGAA
ATGAGCGAAG AAGTTGGTTC AGTCGGCCTG CCCGACGTGT TCAGAGCCTA CAAGGACCGG
CTCACAGCAG ATGTTCTCAT CGCCTCTGAT GGCCCCCGGC TGCAGCCCGA CGTGCCAACC
ATGTTCATGG GTTCGCGCGG GGGCACGACA TTTGATCTTG TGGTTGAAAC GCATGAGGGT
GCGCATCATT CGGGCAATTG GGGCGGGCTT TTGTCGGACC CGGCCATGAT CCTCGCACAT
GCGCTGGCCT GTATCACCGA TGTGCGCGGC CAGATCAAAG TGCCCGAATG GCGCCCGGAT
AGTCTTACCG AGAATGTGCG CATGGCGCTT CGAGACCTCC CTGTCGCGGG AGGACAGGGG
CCAGCGGTGA ACCCCGACTG GGGCGAAGAA GACCTGACCC CGGCAGAGCG CGTCTTTGGC
TGGAACAGCT TTACGGTTCT GGCGATGGTT TCGGGTGTGC CAGAAGCGCC TGTCAATGCG
ATCTCGGGTT GGGCGCGCGC GACGTGTCAG TTGCGATACG TTGTCGGCAC CGACCCGGAG
GACGTGGTGC CCGCATTGCG GCGCCATTTG GACGCGCATG GCTTCGAGAG CGTCGAAATC
CGCTGCCACG AACGAGGCTT TTTTGCCGCA ACCCGTCTGG ACCCCGATCA CCCTTGGGCG
CAGTTCGTTG GAGAGTCGAT CCGCAGGACT TCTGGTGCGC TGCATGTGCT TCCAAACCTT
GCAGGCTCTT TGCCAAATGA CAGCTTCACC GACATCCTGG AGGTGCCGAC AATTTGGGTG
CCTCATTCCT ACAGAGGCTG TTCGCAGCAT GCGCCAAACG AACACGTATT GAAATCTGTA
TACCACGACG CGTTGAGAGT GATGGCCGGA GTCTTCTGGG ACCTTGGCGA ACAGGGCGGA
CCACTCGCCT GA
 
Protein sequence
MSRESALSRA SRYFEAGDFQ SDLAHLVTYR SESQNTAPEA RMECRRYLEE AMLPRLRALG 
FDCEIIDNPD PTGGPLLIGE RREGEALPTI LTYGHGDAVL GQEGRWREGL EPWVLVEEGD
RLYGRGTADN KGQHLINIAA LEAVLAERGH LGFNTRIVIE MSEEVGSVGL PDVFRAYKDR
LTADVLIASD GPRLQPDVPT MFMGSRGGTT FDLVVETHEG AHHSGNWGGL LSDPAMILAH
ALACITDVRG QIKVPEWRPD SLTENVRMAL RDLPVAGGQG PAVNPDWGEE DLTPAERVFG
WNSFTVLAMV SGVPEAPVNA ISGWARATCQ LRYVVGTDPE DVVPALRRHL DAHGFESVEI
RCHERGFFAA TRLDPDHPWA QFVGESIRRT SGALHVLPNL AGSLPNDSFT DILEVPTIWV
PHSYRGCSQH APNEHVLKSV YHDALRVMAG VFWDLGEQGG PLA