Gene TM1040_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2800 
Symbol 
ID4076568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2962121 
End bp2963587 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content59% 
IMG OID638008125 
ProductAMP nucleosidase 
Protein accessionYP_614794 
Protein GI99082640 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0775] Nucleoside phosphorylase 
TIGRFAM ID[TIGR01717] AMP nucleosidase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.955099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATG TAAAAACACC CGACCTGCCC GCTGCCGAAA GCTTTACCGA CGCCAAAGCG 
GCCGTAGCGC GCCTGATTGA GCTCTACGAT AGCGCAACCA ATTTTCTCTG TGACGAGTTT
ATTGCCGCGA TGAGCGCCGG AGCTGCGCCG AGCAGCCGCA TGCGAGCCTA CTATCCCGAA
GTGCGCTTCA CCACCACGAC CTACGCGCAG GTGGACAGTC GCCTGAGCTT TGGTCATGTG
GCCGATCCCG GCATCTATTC CACCACCGTC ACACGGCCCG ATCTGTTTCG TCATTACCTG
GAGCAGCAAA TCGGGTTGCT GATCAAGAAC CACGACCAGC CGGTGACAGT GGGGGTATCG
CAAACGCCGA TGCCAGTGCA TTTTGCCGTC GCCTCGCGCC CGGGACTGAC AGTACCGCAG
GAGGGCGCCG CGCGCTTCAC CCTGCGCGAT GTCTTTGATG TGCCGGATCT CGCCACAACA
AACGACGACA TCGTAAATGG CACCTATGTG CCCGAAAACG GTGTCGGTCC GCTGTCGCTC
TTTACCGCCC AGCGCGTCGA CTACTCTCTG GCGCGTCTGT CACACTATAC CGCGACCGAC
CCGGAGCATT TCCAGAACCA TGTGTTGTTC ACCAACTACC AGTTCTACGT GACAGAGTTC
GAAGCCTATG CGCGCGAACA ACTGGCCGAT CCAGACAGCG GCTACACCAG TTTTGTCTCC
ACCGGAAACG TCGAGATCAC CAAAGCGGAT GGAGAGCTGC CAGCATCGCT CAAAATGCCG
CAGATGCCGA CTTACCACCT GAAACGCCCC GACGGAAACG GCATCACACT GGTCAACATT
GGCGTTGGCC CGTCAAACGC GAAAACCGCA ACCGATCACA TTGCGGTCCT GCGCCCACAC
GCCTGGCTGA TGGTGGGCCA TTGTGCGGGT CTGAGGAACT CTCAGGCATT GGGAGATTTT
GTGCTTGCTC ATGCCTATCT GCGCGAGGAT CACGTTCTGG ATGACGACCT GCCGGTCTGG
GTACCGATCC CGGCATTGGC AGAAATTCAG GTCGCGCTCG AGGAAGCCGT GGCAGAGGAA
ACCGGACTCG AAGGCTATGA TCTCAAGCGG ATCATGCGGA CTGGCACCGT TGCAAGCCAC
GACAACCGCA ACTGGGAACT GCGCGACCAG TCTGGTCCCG TGCAACGTCT GAGCCAATCC
CGTGCGATTG CTCTTGATAT GGAGAGCGCC ACCATCGCCG CCAATGGCTA TCGGTTCCGG
GTTCCCTACG GCACGCTCCT GTGCATTTCC GACAAGCCCC TGCACGGTGA ATTGAAGCTG
CCCGGCATGG CGTCCGATTT CTATCGCACG CAGGTGTCGC GGCATCTGCT GATCGGGATC
AAGGCAATGG AGCGACTGAG AACAATGCCG CTCGAGCGTC TCCACAGCCG GAAACTGCGC
TCTTTTGACG AAACGGCCTT CCTGTAA
 
Protein sequence
MNDVKTPDLP AAESFTDAKA AVARLIELYD SATNFLCDEF IAAMSAGAAP SSRMRAYYPE 
VRFTTTTYAQ VDSRLSFGHV ADPGIYSTTV TRPDLFRHYL EQQIGLLIKN HDQPVTVGVS
QTPMPVHFAV ASRPGLTVPQ EGAARFTLRD VFDVPDLATT NDDIVNGTYV PENGVGPLSL
FTAQRVDYSL ARLSHYTATD PEHFQNHVLF TNYQFYVTEF EAYAREQLAD PDSGYTSFVS
TGNVEITKAD GELPASLKMP QMPTYHLKRP DGNGITLVNI GVGPSNAKTA TDHIAVLRPH
AWLMVGHCAG LRNSQALGDF VLAHAYLRED HVLDDDLPVW VPIPALAEIQ VALEEAVAEE
TGLEGYDLKR IMRTGTVASH DNRNWELRDQ SGPVQRLSQS RAIALDMESA TIAANGYRFR
VPYGTLLCIS DKPLHGELKL PGMASDFYRT QVSRHLLIGI KAMERLRTMP LERLHSRKLR
SFDETAFL