Gene TM1040_0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0641 
Symbol 
ID4076128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp683825 
End bp685609 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content63% 
IMG OID638005938 
Productpeptidase M24 
Protein accessionYP_612636 
Protein GI99080482 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.117815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCAAA ACTTTGATGT CACGGCCCGT CCCGAGCAGG GCCCGGCGCG CCTTGCCGCG 
CTTCGCGCTG AGATGGAGCG AGACAAGATC GACGGTTTTT TGGTCCCGCG CGCCGATGCG
CATCAAGGCG AATACGTCGC CCCACGGGAT GAACGTCTGG CGTGGCTCAC CGGCTTTACC
GGCTCTGCGG GATTCTGCGC CGTGCTGCCG CATATTGCTG GCGTGTTTAT CGACGGGCGC
TATCGCACAC AGGTCAAAGG CCAAGTCGCG GATGTCTATA CCCCTGTTCC TTGGCCGGAT
GTGACTTTGG GCGATTGGCT GGTGGAGCAA CTGCCAGAGG GCGGGATTGT CGCCTATGAC
CCCTGGCTGC ATTCCCTGCA GGAGATCCGC GACCTGACCG AGCGGCTCGT CTCGTCGGAT
ATTTCACTGG TGGAAAGCGA CAATCTAGTA GACCGCATCT GGCCCGACCA GCCAGCGCCC
CCGATGCAAC CGGCTCGGGC GCATTCGGAG GACTATGCCG GAGAGAGCGC CGAGAAGAAG
GCTCAGCGCC TGGCCGAGGG CCTGCGTAAA AGCGGACAGT CCGCAGCGGT CATCACTCTT
CCAGACAGCA TCATGTGGCT CCTGAATATC CGTGGTTCTG ACATTCCGCG CAATCCGGTC
GCCCATGCTT TTGCGATCCT GCATGATGAC GCCCGAGTGG ACCTGTTTAT GGCAGCAGAG
AAGCTCTCTG AGCTCGTATT GGGCGCGCAT GTGACCCTAC ACGCGCCTGA TCGCTTTCTC
GAGGCCACAG CCGGCCTCAA TGGTCAGGTC GCGGTGGACG CGCGCAGCCT GCCACAGGCT
GTTGCACGGG TTTTGGGCGA CAGGCTGGCG GCGGTCGGAG ACCCCTGCGC CCTGCCAAAG
GCCCGCAAGA ACGCCGCCGA GATCGCAGGC AGCGCGGCCG CACATCTGCG CGATGGGGCT
GCCGTTGTCG AAACGCTGGC TTGGCTCGAT ACGCAGGAAC CGGGCACGAT TACCGAAATC
GACGTGGTCA AGACACTCGA AGGGTTCCGC GCGGCAGATC CCGCGTTGCG TGACATCAGC
TTTGAAACCA TCGCAGGGAC AGGCGCCAAT GGCGCAATCA TGCATTACCG TGTGACACAT
GATACCAATG CGACGCTCCA AGAGGGTCAT CTTCTGGTGC TCGACAGCGG CGGGCAATAT
CTCGATGGCA CCACCGACAT CACCCGCACC ATCGCCATTG GATCGCCGGG CCGTGAGGAA
GCCGAAGCCT TTACGCGCGT CCTGCAGGGC ATGATCGCGG TCTCCCGGTT GCGTTGGCCC
GAGGGGCGGT CCGGGCGCGA ACTTGAGGCT ATCGGCCGCC TGCCTCTCTG GATGGCTGGA
CAGGATTTCA ACCATGGGCT TGGTCATGGC GTCGGCGCCT TCCTCAGCGT ACATGAAGGA
CCGCAGGGTC TTTCCCGCAT TAATACGGTA CCGCTTGAGC CGGGCATGAT CCTGTCCAAC
GAGCCGGGCT ACTACCGGGA GGGCGCCTTT GGCATCCGGA TTGAAAACCT CGTAGTGGTA
GAAGAGGCCC CAGCCCTTGA CACCGCCGAC CCAGACCGCA AGATGCTCGC GTGGCGCACG
CTGACGTTTG CGCCCATCGA CCGCCGTTTG GTGGTGCCCG AGATGCTGAG CTCGGGGGAG
CGCGAGTGGC TCAATAGCTA TCACGCAGAG GTAAACCGCA CTATCGCGCC GCGCGTCAGC
GCCGCTGCGG CAGAGTGGTT GAACGCGGCC TGCGCGCCGC TGTGA
 
Protein sequence
MFQNFDVTAR PEQGPARLAA LRAEMERDKI DGFLVPRADA HQGEYVAPRD ERLAWLTGFT 
GSAGFCAVLP HIAGVFIDGR YRTQVKGQVA DVYTPVPWPD VTLGDWLVEQ LPEGGIVAYD
PWLHSLQEIR DLTERLVSSD ISLVESDNLV DRIWPDQPAP PMQPARAHSE DYAGESAEKK
AQRLAEGLRK SGQSAAVITL PDSIMWLLNI RGSDIPRNPV AHAFAILHDD ARVDLFMAAE
KLSELVLGAH VTLHAPDRFL EATAGLNGQV AVDARSLPQA VARVLGDRLA AVGDPCALPK
ARKNAAEIAG SAAAHLRDGA AVVETLAWLD TQEPGTITEI DVVKTLEGFR AADPALRDIS
FETIAGTGAN GAIMHYRVTH DTNATLQEGH LLVLDSGGQY LDGTTDITRT IAIGSPGREE
AEAFTRVLQG MIAVSRLRWP EGRSGRELEA IGRLPLWMAG QDFNHGLGHG VGAFLSVHEG
PQGLSRINTV PLEPGMILSN EPGYYREGAF GIRIENLVVV EEAPALDTAD PDRKMLAWRT
LTFAPIDRRL VVPEMLSSGE REWLNSYHAE VNRTIAPRVS AAAAEWLNAA CAPL