Gene TM1040_1886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1886 
Symbol 
ID4077383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1986308 
End bp1987381 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content63% 
IMG OID638007202 
Productintegral membrane protein-like 
Protein accessionYP_613881 
Protein GI99081727 
COG category[S] Function unknown 
COG ID[COG5480] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTCC CGGCTGTTAG GCGCCTGTGC CTGCTGGCCG CTACGCTCGC CTTGCCGTCT 
GCGGGGCAGG CGGCGCTTGA CCTGTGCAAC GACACCACCG CGGCGCAGCG GGTGGCAATC
GGCTTTCAGG AGGCGGGCGA CTGGACCTCA AAAGGCTGGT GGGATCTGCC TGCTGGCAGC
TGTACAGAGG TGCTCTCTTC CGCGTCGACA AGCCGGTTTT ATTATCTGCG GGTAGAGACC
GAGGGCTGGG CCTTCACCGA TGACAGGCTT GGGTTTTGCG TTGCTGACAC GGATTTTGAA
ATCAAGGGCG AAGATGGGTG CGCGCGCCGT GGCTTTCGCC AGGAGAATTT CGCACGGATC
GACACCAGGG GCGCCGCTGC ACCCGACCCA ACAGCCCAGA CGCGGCCAGC CGCAGATCCG
GACACTGGGT CCGACGCCGC GCGCAGGACT TTTACACACC ACCTCAGCGC CCATCTCACT
CCGATCAAAT CCGACGTCCC GGCGCGCCAT GTGGTTCAGT CCGGGTTCAG CACCAAAGCC
GTGTTTCAAG GCTGTGATGC GGAAACCACC GCCTATGCCA GCTTTTGCAC CTTCATCGGC
GCCGGGCGGC GCTATCTGGT TTATGACGAC GGGCGCACCT CCGCCGCGCT CTGGCAGGAG
ATCCAGCAAG CCATCCGCGG GCGACGCTAC ACGCTCGAGG GTTTGCGAGA GGATCTTTTT
GACACCACGT CAGAACTGGT TCTGCGCGCC ATCCGACCAG AGCCCGAGGA TCGCTCGGAT
CAGCTCTTGG CCAGCCTGCA GGGTCTCTGG CGCTCCACCA TCGATCCCAA TGACAGCTTT
CGCGTCAGCG GGGCCGAGCG CATAAACGCC TATGCCGGGG CGGAAACATC GGTCGAATAC
CTCTCATTGC ACAAACCCTG CGCCGAGGCG GGCGACGTGG GTCCCTTCCT TTTTACCTGG
GACAACAATT CCGGCACCAG CCTGTGCTAC GCCATTGCCC GCCTCACTGA CACCGAACTG
GCGCTGATCT ACCTTCCGCG CGGGACCGAA CTGGTCTACC GCCGCGAGGG CTGA
 
Protein sequence
MSFPAVRRLC LLAATLALPS AGQAALDLCN DTTAAQRVAI GFQEAGDWTS KGWWDLPAGS 
CTEVLSSAST SRFYYLRVET EGWAFTDDRL GFCVADTDFE IKGEDGCARR GFRQENFARI
DTRGAAAPDP TAQTRPAADP DTGSDAARRT FTHHLSAHLT PIKSDVPARH VVQSGFSTKA
VFQGCDAETT AYASFCTFIG AGRRYLVYDD GRTSAALWQE IQQAIRGRRY TLEGLREDLF
DTTSELVLRA IRPEPEDRSD QLLASLQGLW RSTIDPNDSF RVSGAERINA YAGAETSVEY
LSLHKPCAEA GDVGPFLFTW DNNSGTSLCY AIARLTDTEL ALIYLPRGTE LVYRREG