Gene TM1040_1548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1548 
Symbol 
ID4075846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1654635 
End bp1656215 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content60% 
IMG OID638006861 
Product5'-nucleotidase-like 
Protein accessionYP_613543 
Protein GI99081389 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00999704 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.364034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCAC GTTTTCTGAC GTCGGCAGCG GCGCTGGGGC TTACGGCTGG CATGGCCGCG 
GCCGAGTACA AGCTGACGAT CTTGCACACC AACGACATCC ACAGCCGCAT CGAGTCGATC
AGCAAATATG ATTCCACTTG CGGCGCCGAT GACGAGGCCG AGGGCAAGTG TTTTGGCGGC
ATCGCCCGCG TCAAGACCAT GGTCGACACC AAACGTGCCG AGCTCGACGG CCAGAACGTG
CTTCTGCTGG ACGCGGGCGA CCCGTTCCAG GGCTCGCTGT TTTACACCAC CTACAAGGGG
GCAGCCGAAG CCGAGTTCAT GGAAGACATC GGCTATGACG TAATGGCGGT GGGCAACCAC
GAATTTGACG ACGGACCGGC CGGTCTGCAG CAATTTGTCG ACACTGTGTC TTTCCCGGTG
ATTTCCGGCA ACCTCGATCT GAGCTCCGAG CCGCTCCTGA AAGGCAAGGT GGGCAACCAT
GTCGTGCTTG AAGTGGGCGG CGAGAAAATC GGCATCATCT CCGCGCTGGC GACAGACACG
GTCGAGACCT CCTCGCCGGG GCCGAATGTG GTGTTTCAGG ATGAGATCGA CAGCCTGATC
GCCGACGTTG AGGCCCTGCA GGCAGAAGGC GTCAACAAGA TCATCGCGCT GACCCATGTG
GGTCTGGCCA AGGATATGGA AATCGCCGCC AAAGTGCCGG GGGTGGATCT CGTCGTGGGT
GGTCATTCGC ACACGCTTTT GTCCAACACC TCTGATCGTG CCGCGGGCGC ATATCCGACC
ATGGTGGGCG ATGTGCCAGT GGTGCAGGCC TATGCCTATA CCAAGTACCT GGGCGAGCTC
ACTGTGACCT TTGATGACGA AGGCAATGTC ATCTCCGCTG CGGGCGAGCC GATCCTGCTT
GATGCCTCTG TGACGCCGGA TGCCGACATG GTCGCGCGCA TCAAGGAGAT GGGTGCTCCC
ATCGATGAGA TGAAAACCCG CGTGGTTGCC GAGACAACCG ATGCGGTCGA AGGCTCGCGT
GATGTCTGCC GCGCTGGCGA ATGTGCCATG GGCAACCTCG TCGCGGATGC CATGCTGGCC
CGCGTCAAGG ATCAGGGTGT GAGCATTGCG ATCCAGAACG GTGGCGGTCT GCGCGCATCG
ATCGATGCGG GCGAAGTCAC CATGGGTGAA GTGCTGAGCG TCCTGCCGTT CCAGAACACG
CTCTCCACCT TTGAGGTCTC CGGCCAGACG ATGATTGAGG CCTTGGAAAA CGGCGTTGGG
CAGATCGAGG ACGGCGCAGG CCGCTTCCCG CAGGTTGCAG GGCTGAAATA TGCGTTTGAC
GCCTCCAAGG AGCCGGGCGC GCGCATTTCC GACGTGATGG TCATGGAAGG CGAGACCTGG
GTTGCGATTG ATCCGGCCAA AACCTACGGC GTTGTGTCCA ACAACTACGT GCGCAATGGC
GGCGACGGCT ACAAGATGTT CGCAGGCGAC GACAAGAACG CTTATGACTT TGGCCCCGAC
CTTGCGGATG TTGTTGCCGA ATACCTCGCC GAGGTCGGCC CCTACAGCGC CTATACCGAC
GGCCGCATCA CCAAGAAGTA A
 
Protein sequence
MISRFLTSAA ALGLTAGMAA AEYKLTILHT NDIHSRIESI SKYDSTCGAD DEAEGKCFGG 
IARVKTMVDT KRAELDGQNV LLLDAGDPFQ GSLFYTTYKG AAEAEFMEDI GYDVMAVGNH
EFDDGPAGLQ QFVDTVSFPV ISGNLDLSSE PLLKGKVGNH VVLEVGGEKI GIISALATDT
VETSSPGPNV VFQDEIDSLI ADVEALQAEG VNKIIALTHV GLAKDMEIAA KVPGVDLVVG
GHSHTLLSNT SDRAAGAYPT MVGDVPVVQA YAYTKYLGEL TVTFDDEGNV ISAAGEPILL
DASVTPDADM VARIKEMGAP IDEMKTRVVA ETTDAVEGSR DVCRAGECAM GNLVADAMLA
RVKDQGVSIA IQNGGGLRAS IDAGEVTMGE VLSVLPFQNT LSTFEVSGQT MIEALENGVG
QIEDGAGRFP QVAGLKYAFD ASKEPGARIS DVMVMEGETW VAIDPAKTYG VVSNNYVRNG
GDGYKMFAGD DKNAYDFGPD LADVVAEYLA EVGPYSAYTD GRITKK