Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3087 |
Symbol | |
ID | 4075533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 54543 |
End bp | 55976 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638004588 |
Product | hypothetical protein |
Protein accession | YP_611323 |
Protein GI | 99078065 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGACA TCGACATTGT CAGATCCGAC AACGATCCGC GCCAGAGCGA TCACGGCATC CGCACCGATA CCGCGTCGCG CGGGGTGGAC CTCGAAGGGG CGAAGATCAC CGTCACCTAT ACCGACGGAA CCACCGAGAC GCTCACCTGG AAAGCGCTTG ATCCCTATAC TGCTGGGGGC GCGACCGGTG ACAATATCGA CATGTACTTC GGCTATGACT GGCACCAGCT GACCACGACA AAGCCCCTCG CCTCGCTCAA GATCGATCTG GCCCCGGCAA ACTCCGTGTT CGACACAACC TTTGCCAGTG ATGGCAACCC GAACGATCCC TCCACTCCAG GCTCCAAAGA AGGCTTTCCC TTCAAGGTCT CGCCCGACTA CGAGGACCTT GGCGGCACGA TCACCGCGAC CTACTCCGGG ATCGCGGGGC TCGATGGCAA TGCGCCCGTA GGGGATCTCT ACACCACGAT GACGCTGGAT TTCACCAGCC TGCCCGGCGG TGGCTTGCAG GGGGATCTGG TCTGGAACTC CGATATCGAC ACGCTTACCG GTCCCTTTGC CTCTTCTGTG ACCGCGACGG ACGACTTTGT TTCCATCTCC GGGAATGGCT CGGAAGTGAT TGATATTCTC GGCAATGACG CCGGGGCGGG CAAGGGTCCT CTGACCATTT CCCATATCGC GGGCACCGCA ATATCGGCCG GAGAGTCGGT GACGCTCGCA AGCGGTGAGG TGATTACCCT CAATCCCGAC GGAACCTTGT CGATCACCAA CGACAGCCCC GAGGATGAAA CCAACAGCTT CACCTACACA GTGGTGGATG AGGCAGGAAA TACCGCGACC GGCACCGTCA CGGTTGACAC CAAACTCACA GCGCCCCCCT GCTTTGTCGC GGGCACGTTG ATCGACACCG AAGCGGGGCC GATCGCGGTC GAAGATCTCA CCGTCGGCAT GCAGGTGATG ACGCGGGACC ATGGGCCACA ACCACTGCGC TGGATCGGTC GCAGCACCCG ACTCGCGCGG GGCCGCAACG CACCAGTGCT GATCGCCCGC AACGCTTTGG GCCACCATGG CCAGATCGCA CTTTCGCCCA ATCACCGCGT CCTGATCCGA TCAGAGCTCG CAGATCTGCA ATTTGGCCAG CCCGAGGTGC TCATCAAGGC GAAGCATCTG GTCAATGGAG ACAGCATTCG GATTCTCAGC GATGGCACGC AGGTGTGCTA TGTGCACATC CTCTTTGACC GCCACGAGAT CGTGCGTGCC AATGGTCTGG ACAGCGAGAG CTACCACCCC GGCGCTGAAA CCCTTGGTGC TTTTGACGCG GACACCCGAG CCGAGATTCT CGACCTCATG TCAGACTGGC AAGAGTATGG ACCTGCTGCA CGGATCACGC TCAAGCCGCG CGAATGTGCG CTGCTTCAAC TAAACCGACA CTAA
|
Protein sequence | MPDIDIVRSD NDPRQSDHGI RTDTASRGVD LEGAKITVTY TDGTTETLTW KALDPYTAGG ATGDNIDMYF GYDWHQLTTT KPLASLKIDL APANSVFDTT FASDGNPNDP STPGSKEGFP FKVSPDYEDL GGTITATYSG IAGLDGNAPV GDLYTTMTLD FTSLPGGGLQ GDLVWNSDID TLTGPFASSV TATDDFVSIS GNGSEVIDIL GNDAGAGKGP LTISHIAGTA ISAGESVTLA SGEVITLNPD GTLSITNDSP EDETNSFTYT VVDEAGNTAT GTVTVDTKLT APPCFVAGTL IDTEAGPIAV EDLTVGMQVM TRDHGPQPLR WIGRSTRLAR GRNAPVLIAR NALGHHGQIA LSPNHRVLIR SELADLQFGQ PEVLIKAKHL VNGDSIRILS DGTQVCYVHI LFDRHEIVRA NGLDSESYHP GAETLGAFDA DTRAEILDLM SDWQEYGPAA RITLKPRECA LLQLNRH
|
| |