Gene TM1040_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1921 
Symbol 
ID4076872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2022229 
End bp2023398 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content62% 
IMG OID638007237 
Productpeptidase M24 
Protein accessionYP_613916 
Protein GI99081762 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.927286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.795034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCCA ACGATTTTAC CACAACCGAA TACGCCCGCA GACTGGAAAA GACCCGCCGC 
GCCATGGCCG CAAAGGGTCT TAAAACACTG GTGATCTGCG ATCCCTCGAA CATGGCATGG
CTCACCGGAT ATGACGGCTG GAGCTTCTAT GTGCCTCAGG CCGTGATTCT CCATGAGGAC
GGCATGCCCA TGTGGTGGGG CCGCACCCAG GACAAACCCG GTGCGGCGCT TACGACCTGG
CTTGAGGCGG ATCATCTCTT TGACTGGCCC GAAGAACACG TCCAGCACCC GGATCACCAC
CCGTTTGACT CGCTGGTGGA TCTACTGCGG GAATTTGGCT GGACCGAGGC CATTGGCGTC
GAAATGGACA ACTACTATTA CTCGGCGCGC AGCCACGAGA TCCTCGAGAC CGCCTTTGGT
CGCGATGCTT TTGCCGATGC TACCGGCCTT GTGAACTGGC AGCGCGCCGT CAAGAGCGAG
CAGGAACTGC AGCTGATGCA AGCCGCTGGT AAGCTCTCGG CGCATATGCA CGGGGTACTG
CGCGCGGAAT TCAACGAAGG CATCGCCAAG AACGCGCTGG TGGCGCGGGT GCAGGCCGCA
GGTATCGAAG GGCTGCCGCT TCTGGCGGGC GACTATCCGG CAATCTCGCC GATTGCGCCC
TCGGGGATCG AGGCCTCAGC CTCGCATATC ACCTGGAACG ACCGCCCCCT GGCCCCCGGC
GAGGCAACCT ATTTCGAAAT CTCGGGATGT GTGCGCCGCT ATCACTGCCC GATCAGCCGC
ACGCTGTTCC TCGGGAGCCC GCCCGAAGAC ATCCGCCGTG GTGAAAACGC GATCCTGCAG
GCCATCGAGG ACACCTTTGC CGTCGCCAAA CCCGGTGTCA CCTGCGAAGA GGTTGCGGCC
TGCGTCTATG AGAGCTTTGG TCGCGCGGGC TACATCAAGG GCAACCGCAC CGGGTATCCC
GTGGGCCTCA GCTATCCGCC GGACTGGGGC GAGCGCACCA TGTCACTGCG TCCGGGTGAC
ACTACCAAGC TTGAGGAAAA CATGACCTTC CACCTGATGC CGGGTCTCTG GACACCCGAT
TGGGGCATGG CCATCACCGA GACCTTCGTG GTGACCCCCA ATGGCGGTGA GCCTCTGGCG
GATGTCCCGC GTGAAATCGT GGTGAAATAA
 
Protein sequence
MPANDFTTTE YARRLEKTRR AMAAKGLKTL VICDPSNMAW LTGYDGWSFY VPQAVILHED 
GMPMWWGRTQ DKPGAALTTW LEADHLFDWP EEHVQHPDHH PFDSLVDLLR EFGWTEAIGV
EMDNYYYSAR SHEILETAFG RDAFADATGL VNWQRAVKSE QELQLMQAAG KLSAHMHGVL
RAEFNEGIAK NALVARVQAA GIEGLPLLAG DYPAISPIAP SGIEASASHI TWNDRPLAPG
EATYFEISGC VRRYHCPISR TLFLGSPPED IRRGENAILQ AIEDTFAVAK PGVTCEEVAA
CVYESFGRAG YIKGNRTGYP VGLSYPPDWG ERTMSLRPGD TTKLEENMTF HLMPGLWTPD
WGMAITETFV VTPNGGEPLA DVPREIVVK