Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1817 |
Symbol | |
ID | 4076963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1910678 |
End bp | 1911868 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638007132 |
Product | hypothetical protein |
Protein accession | YP_613812 |
Protein GI | 99081658 |
COG category | [S] Function unknown |
COG ID | [COG4260] Putative virion core protein (lumpy skin disease virus) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0944379 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.317367 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGCATTT TTGATTTTCT CAAAGGCGAA TTCATCGACG TCATCCATTG GACCGATGAT ACCCGCGACA CCATGGTCAT GCGTTTCGAG CGCGAGGGCC ATGCGATCAA ATACGGGGCC AAGCTGACGG TCCGCGAGGG TCAGGCGGCC GTGTTTGTGC ATGAAGGCCA GCTGGCGGAT GTGTTCACGC CCGGGCTTTA TATGCTTGAA ACCAACAACA TGCCGGTGCT GACCACGCTT CAGCATTGGG ATCACGGGTT TCAGTCGCCG TTCAAATCCG AGATCTATTT TGTCGCCACC ACGCGGTTCA ATGACCTCAA ATGGGGCACC AAGAACCCGA TCATGTGTCG CGACCCGGAG TTCGGCCCGG TGCGCCTGCG CGCCTTTGGC ACCTATTCGG TGCGGGTGGT GGACCCGGCC CGCTTCCTGA CGGAAATCGT CGGCACCGAT GGTGAGTTCA CCATGGATGA GATCTCTTAC CAGATCCGCA ATATCATTGT GCAGGAGTTC TCGCGCGCGA TTGCCGCTTC TGGCATCCCG GTGCTTGATA TGGCCGCCAA TACCGCCGAT CTGGGCAAGC TGGTCGCCGC CGAGATCGGC CCCGTGGTGG CTGAATACGG CCTCGCGATC CCCGAGCTTT ATGTCGAGAA TATCTCCCTG CCGCCCGCGG TCGAGCAGGC GATGGACAAG CGCACCCAGA TGGGCATCAT CGGCGATCTC GGCCGCTATA CGCAATTCAA GGCTGCCGAA GCCATGGAAG CGGCTGCCAA AACGCCCAAC AGCGGCATGG GCGCCGGGCT TGGGATGGGA ATGGGCATGG CAATGGCACA GCAGATGGGC CACGCGATGC AAGGAGGCGC GACACCTCAG GCAGCGGGTC AACCGACCGG GCCATGGGGC GCACGGCCCG CACCCGCTGC GCCGCAACCT GCGGCTCAGG CCGCACCGAT GGCACCGCCG CCCCCGCCGG TGGAGCATGT CTGGCACATC GCGGAAAACG GTCAGACCTC GGGTCCATAC TCCAAGGCGC GCATGGGGCG CATGGCGCAA GAAGGCCAGC TGACGCGCGA CACCCATGTA TGGACCCCCG GTCAGGACGG CTGGATGCGC GCGGGCGATG TGACCGAGCT GGCACAGCTC TTCACCATCC TGCCGCCGCC CCCACCGCCG CCCCCGCCCG CAGGCGGCTA A
|
Protein sequence | MGIFDFLKGE FIDVIHWTDD TRDTMVMRFE REGHAIKYGA KLTVREGQAA VFVHEGQLAD VFTPGLYMLE TNNMPVLTTL QHWDHGFQSP FKSEIYFVAT TRFNDLKWGT KNPIMCRDPE FGPVRLRAFG TYSVRVVDPA RFLTEIVGTD GEFTMDEISY QIRNIIVQEF SRAIAASGIP VLDMAANTAD LGKLVAAEIG PVVAEYGLAI PELYVENISL PPAVEQAMDK RTQMGIIGDL GRYTQFKAAE AMEAAAKTPN SGMGAGLGMG MGMAMAQQMG HAMQGGATPQ AAGQPTGPWG ARPAPAAPQP AAQAAPMAPP PPPVEHVWHI AENGQTSGPY SKARMGRMAQ EGQLTRDTHV WTPGQDGWMR AGDVTELAQL FTILPPPPPP PPPAGG
|
| |