Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1965 |
Symbol | |
ID | 4077149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2069025 |
End bp | 2069999 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638007280 |
Product | hypothetical protein |
Protein accession | YP_613959 |
Protein GI | 99081805 |
COG category | [R] General function prediction only |
COG ID | [COG5006] Predicted permease, DMT superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.124551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.159447 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAGA CCTCAGCCAC ACCCGACACC ACCGACCGCC CGCTTTTGGG GATTGCCCTG ATGCTTGGGT TTTGCGCCCT CATTCCGCTC GGAGATGCGG TGGCAAAGCT GTTGTCGACA CGAATCCCGG TGGGGCAGAT CGTGTTTGTC CGCTTCGCCG CTCAGGGTGT AATCTTGGCG CCGGTGGCGC TGATGCTAGG GATCTCGCTG CGCTTGCCGC GCCGCATTCT GCCCATCGTG TTGCTGCGCA CCCTGCTGCA AATGGGTGGT ATCACTGCAA TGTTTATGGC GCTGCGCTTC CTGCCGCTGG CGGATGCGGT GGCGATTGCC TTTGTGATGC CTTTCATCAT GCTGCTTTTG GGCAAATACG TCCTCAAGGA AGAGGTCGGC CTGCGCCGTC TTCTGGCCTG CGTCGTGGGC TTTGCGGGTA CGCTTCTGGT GATCCAGCCC AGCTTTGCGG CCGTCGGGCT CAACGCGCTC TGGCCCTTGG CGGTGGCGGT CATCTTTGCG GTGTTCATGA TGGTCACCCG CACCATCGCG CGCGACACCG ACCCGATTGC CATTCAGGCG GTCTCTGGCG GCATTGCCTC CGTGCTGATG GCCATACTCT TTGTGCTTGG GGCGCAATTC GAGGTCGCCG AGCTGGCGAC CAACCTGCCC GCGCGTCCCG AGATTAACCT GCTGCTCCTG GCAGGGCTTT TTGGCACCAT CGCGCATCTC CTGATGACGT GGTCGCTGCG CTATGCGCCC ACAAGCACCC TCGCCTCGAT GCAATATCTG GAGATCCCCG TGGCGGTCTT TGTCGGCTGG CTGTTCTTTG CGGAACTGCC CAACACGATC GCCGCCTGCG GCATCGCGCT CACGATGGCG GCGGGGCTTT ATGCCGTGAT GCGCGAACGC CAGGTGAGCC GAGCTGCGCG CAAAGTCGAT CCGACACCGA TCAGCAAGAC GGGCCTGCCT GCATCACCTG AATAA
|
Protein sequence | MTQTSATPDT TDRPLLGIAL MLGFCALIPL GDAVAKLLST RIPVGQIVFV RFAAQGVILA PVALMLGISL RLPRRILPIV LLRTLLQMGG ITAMFMALRF LPLADAVAIA FVMPFIMLLL GKYVLKEEVG LRRLLACVVG FAGTLLVIQP SFAAVGLNAL WPLAVAVIFA VFMMVTRTIA RDTDPIAIQA VSGGIASVLM AILFVLGAQF EVAELATNLP ARPEINLLLL AGLFGTIAHL LMTWSLRYAP TSTLASMQYL EIPVAVFVGW LFFAELPNTI AACGIALTMA AGLYAVMRER QVSRAARKVD PTPISKTGLP ASPE
|
| |