Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1921 |
Symbol | |
ID | 4076872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2022229 |
End bp | 2023398 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638007237 |
Product | peptidase M24 |
Protein accession | YP_613916 |
Protein GI | 99081762 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.927286 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.795034 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGCCA ACGATTTTAC CACAACCGAA TACGCCCGCA GACTGGAAAA GACCCGCCGC GCCATGGCCG CAAAGGGTCT TAAAACACTG GTGATCTGCG ATCCCTCGAA CATGGCATGG CTCACCGGAT ATGACGGCTG GAGCTTCTAT GTGCCTCAGG CCGTGATTCT CCATGAGGAC GGCATGCCCA TGTGGTGGGG CCGCACCCAG GACAAACCCG GTGCGGCGCT TACGACCTGG CTTGAGGCGG ATCATCTCTT TGACTGGCCC GAAGAACACG TCCAGCACCC GGATCACCAC CCGTTTGACT CGCTGGTGGA TCTACTGCGG GAATTTGGCT GGACCGAGGC CATTGGCGTC GAAATGGACA ACTACTATTA CTCGGCGCGC AGCCACGAGA TCCTCGAGAC CGCCTTTGGT CGCGATGCTT TTGCCGATGC TACCGGCCTT GTGAACTGGC AGCGCGCCGT CAAGAGCGAG CAGGAACTGC AGCTGATGCA AGCCGCTGGT AAGCTCTCGG CGCATATGCA CGGGGTACTG CGCGCGGAAT TCAACGAAGG CATCGCCAAG AACGCGCTGG TGGCGCGGGT GCAGGCCGCA GGTATCGAAG GGCTGCCGCT TCTGGCGGGC GACTATCCGG CAATCTCGCC GATTGCGCCC TCGGGGATCG AGGCCTCAGC CTCGCATATC ACCTGGAACG ACCGCCCCCT GGCCCCCGGC GAGGCAACCT ATTTCGAAAT CTCGGGATGT GTGCGCCGCT ATCACTGCCC GATCAGCCGC ACGCTGTTCC TCGGGAGCCC GCCCGAAGAC ATCCGCCGTG GTGAAAACGC GATCCTGCAG GCCATCGAGG ACACCTTTGC CGTCGCCAAA CCCGGTGTCA CCTGCGAAGA GGTTGCGGCC TGCGTCTATG AGAGCTTTGG TCGCGCGGGC TACATCAAGG GCAACCGCAC CGGGTATCCC GTGGGCCTCA GCTATCCGCC GGACTGGGGC GAGCGCACCA TGTCACTGCG TCCGGGTGAC ACTACCAAGC TTGAGGAAAA CATGACCTTC CACCTGATGC CGGGTCTCTG GACACCCGAT TGGGGCATGG CCATCACCGA GACCTTCGTG GTGACCCCCA ATGGCGGTGA GCCTCTGGCG GATGTCCCGC GTGAAATCGT GGTGAAATAA
|
Protein sequence | MPANDFTTTE YARRLEKTRR AMAAKGLKTL VICDPSNMAW LTGYDGWSFY VPQAVILHED GMPMWWGRTQ DKPGAALTTW LEADHLFDWP EEHVQHPDHH PFDSLVDLLR EFGWTEAIGV EMDNYYYSAR SHEILETAFG RDAFADATGL VNWQRAVKSE QELQLMQAAG KLSAHMHGVL RAEFNEGIAK NALVARVQAA GIEGLPLLAG DYPAISPIAP SGIEASASHI TWNDRPLAPG EATYFEISGC VRRYHCPISR TLFLGSPPED IRRGENAILQ AIEDTFAVAK PGVTCEEVAA CVYESFGRAG YIKGNRTGYP VGLSYPPDWG ERTMSLRPGD TTKLEENMTF HLMPGLWTPD WGMAITETFV VTPNGGEPLA DVPREIVVK
|
| |