Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3334 |
Symbol | |
ID | 4075233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 344066 |
End bp | 345592 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638004842 |
Product | hypothetical protein |
Protein accession | YP_611568 |
Protein GI | 99078310 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.881779 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.501512 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAGG ATCTAATCCT TCACGGCGTT CCAGATCGCA TCAGGTTGGA CGGGTGGTGG CTCGAGACGA CAGAAACACC AGAGCCGGAC ATCGCGCCGC CTATGAGGGC AAATGGCTCG TTGATGGCGT CATTGCGCAT GCGGCAGGTG ATCCCCCTGC TTGGCGCGCT TGTGGCACTT GCGGATTGGC TGTTCTGGCA TCAGCCGGTC GGTCTCTCAC TCGCCATTTT TGCCGTCGTT GTCTCCGCAG CGATTCTGGC GGTGAAACCT GAGCGACCAA GCTTACGGAG CTGGGGCCTC GCCATGGGGT TTGCGTTGCT TTGCAATCTG CCAGTGGTGA TCGAGCTTCA ATTTCTGTCA CTCCTGTTTA GCCTCGGAGG TCTCATCACG CTTGCGGCTT GGGCGTTTGC GGGGTCCAGC TTGACGACGG GGCTCATACT GAGGATGGCA CTTCGCCTCC CTGCCTTTGG GCTAGTACAT TTGGTGAAAG ATACCGCCGA TGCGCTACCG CCTGCAACAT ACTCGTCACG GCTCCGGTAC ATGGCGGCCA CGCTGCTATT GCCGCTTCTG ATGGGGGCCG TGTTTCTGGG CCTTCTCGCA AATGCCAACC CAGTCCTACA GGCGGCGCTG GACAGCATCG ATCTGCGCCA CCTGCTCAGG GCTGAATTTT GGACGCGCTT TCTGTTCTGG GGGTGCGTGG CGTCACTCCT CTGGCCGCTC CTCAATCTGA GCGAGTCATG GATTGGGGCA CAAGCCCGCC GAGCGCGCGT AACAAAGGCG GGTCCGCACC GTAGCAGCTT CTTGATCAAT CCGCTTTCCG TGCGCAATTC GCTTTGGCTG TTCAATTTGA TGTTTGGCAT CCAGACCCTG ATGGACCTCA GCATATTAAC CGGCGGAGTG TCGCTGCCCG AGGGCATGAG CTATGCCTCA TATGCACATC GCGGCGCCTA TCCCCTTGTG GCAACGGCGC TGCTCGCCGG ACTCTTTACA CTGCTCACGC GAAATATGAT TGGCCAAGAC AAGGTTCTGC GGTCTCTGGT CTATCTGTGG CTGGCGCAGA ACATGATACT TGTTGCAACA GCCGCGATCC GATTGCAGCA CTATGTTGAG GCCTACGCCC TTACCTACCT GCGTGTCGCG GCATTTATCT GGATGGCTCT GGTTCTGACA GGGCTGCTGT TGACGATCTG GCAAATCCAT CGCGGGTTTG GGACATCATG GCTATTGCGA CGGTGCTTGG CTGCGCTCGC CATCACGCTC TACCTCTCCA GCCTCACAAA TTTTGCCGAC ATAGTCGCCA GATATAATCT CACCCATGGC AGCGCGCTTC GGGGGCCTGA CACCTACTAT ATCTGCAGTC TCGGTCCGGG GGCCTACCGC ACGATACTGG ATCATGAAGC AAGCACCGGA CAGGATATTT GCACACGCAT GATTGAACGC GACCTCGAGC GCATTTCAAT CCGGAACTGG CGCGAATGGG GCTATCGGAT GTGGCGGCTT GAGGCCTATG ATCGGGCGCA GAATTGA
|
Protein sequence | MAQDLILHGV PDRIRLDGWW LETTETPEPD IAPPMRANGS LMASLRMRQV IPLLGALVAL ADWLFWHQPV GLSLAIFAVV VSAAILAVKP ERPSLRSWGL AMGFALLCNL PVVIELQFLS LLFSLGGLIT LAAWAFAGSS LTTGLILRMA LRLPAFGLVH LVKDTADALP PATYSSRLRY MAATLLLPLL MGAVFLGLLA NANPVLQAAL DSIDLRHLLR AEFWTRFLFW GCVASLLWPL LNLSESWIGA QARRARVTKA GPHRSSFLIN PLSVRNSLWL FNLMFGIQTL MDLSILTGGV SLPEGMSYAS YAHRGAYPLV ATALLAGLFT LLTRNMIGQD KVLRSLVYLW LAQNMILVAT AAIRLQHYVE AYALTYLRVA AFIWMALVLT GLLLTIWQIH RGFGTSWLLR RCLAALAITL YLSSLTNFAD IVARYNLTHG SALRGPDTYY ICSLGPGAYR TILDHEASTG QDICTRMIER DLERISIRNW REWGYRMWRL EAYDRAQN
|
| |