Gene TM1040_1817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1817 
Symbol 
ID4076963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1910678 
End bp1911868 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content63% 
IMG OID638007132 
Producthypothetical protein 
Protein accessionYP_613812 
Protein GI99081658 
COG category[S] Function unknown 
COG ID[COG4260] Putative virion core protein (lumpy skin disease virus) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0944379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.317367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCATTT TTGATTTTCT CAAAGGCGAA TTCATCGACG TCATCCATTG GACCGATGAT 
ACCCGCGACA CCATGGTCAT GCGTTTCGAG CGCGAGGGCC ATGCGATCAA ATACGGGGCC
AAGCTGACGG TCCGCGAGGG TCAGGCGGCC GTGTTTGTGC ATGAAGGCCA GCTGGCGGAT
GTGTTCACGC CCGGGCTTTA TATGCTTGAA ACCAACAACA TGCCGGTGCT GACCACGCTT
CAGCATTGGG ATCACGGGTT TCAGTCGCCG TTCAAATCCG AGATCTATTT TGTCGCCACC
ACGCGGTTCA ATGACCTCAA ATGGGGCACC AAGAACCCGA TCATGTGTCG CGACCCGGAG
TTCGGCCCGG TGCGCCTGCG CGCCTTTGGC ACCTATTCGG TGCGGGTGGT GGACCCGGCC
CGCTTCCTGA CGGAAATCGT CGGCACCGAT GGTGAGTTCA CCATGGATGA GATCTCTTAC
CAGATCCGCA ATATCATTGT GCAGGAGTTC TCGCGCGCGA TTGCCGCTTC TGGCATCCCG
GTGCTTGATA TGGCCGCCAA TACCGCCGAT CTGGGCAAGC TGGTCGCCGC CGAGATCGGC
CCCGTGGTGG CTGAATACGG CCTCGCGATC CCCGAGCTTT ATGTCGAGAA TATCTCCCTG
CCGCCCGCGG TCGAGCAGGC GATGGACAAG CGCACCCAGA TGGGCATCAT CGGCGATCTC
GGCCGCTATA CGCAATTCAA GGCTGCCGAA GCCATGGAAG CGGCTGCCAA AACGCCCAAC
AGCGGCATGG GCGCCGGGCT TGGGATGGGA ATGGGCATGG CAATGGCACA GCAGATGGGC
CACGCGATGC AAGGAGGCGC GACACCTCAG GCAGCGGGTC AACCGACCGG GCCATGGGGC
GCACGGCCCG CACCCGCTGC GCCGCAACCT GCGGCTCAGG CCGCACCGAT GGCACCGCCG
CCCCCGCCGG TGGAGCATGT CTGGCACATC GCGGAAAACG GTCAGACCTC GGGTCCATAC
TCCAAGGCGC GCATGGGGCG CATGGCGCAA GAAGGCCAGC TGACGCGCGA CACCCATGTA
TGGACCCCCG GTCAGGACGG CTGGATGCGC GCGGGCGATG TGACCGAGCT GGCACAGCTC
TTCACCATCC TGCCGCCGCC CCCACCGCCG CCCCCGCCCG CAGGCGGCTA A
 
Protein sequence
MGIFDFLKGE FIDVIHWTDD TRDTMVMRFE REGHAIKYGA KLTVREGQAA VFVHEGQLAD 
VFTPGLYMLE TNNMPVLTTL QHWDHGFQSP FKSEIYFVAT TRFNDLKWGT KNPIMCRDPE
FGPVRLRAFG TYSVRVVDPA RFLTEIVGTD GEFTMDEISY QIRNIIVQEF SRAIAASGIP
VLDMAANTAD LGKLVAAEIG PVVAEYGLAI PELYVENISL PPAVEQAMDK RTQMGIIGDL
GRYTQFKAAE AMEAAAKTPN SGMGAGLGMG MGMAMAQQMG HAMQGGATPQ AAGQPTGPWG
ARPAPAAPQP AAQAAPMAPP PPPVEHVWHI AENGQTSGPY SKARMGRMAQ EGQLTRDTHV
WTPGQDGWMR AGDVTELAQL FTILPPPPPP PPPAGG