Gene TM1040_1453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1453 
Symbol 
ID4077750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1551847 
End bp1553010 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content56% 
IMG OID638006764 
Producthypothetical protein 
Protein accessionYP_613448 
Protein GI99081294 
COG category 
COG ID 
TIGRFAM ID[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.445258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCAAA CGCGCTTCTT TAAAATGGCG CAACGAGCAG ATTCAATTTT CGACGGAGGC 
ATCACAGGCA TGCCCACCCA TGACTTTTTT GTCTACACGC CGGATGCGCT GACGTTTTCC
GGCGGTACGA TCCGCGTAAA CTCTGGTTTT GACCCGCTTT CTGATCGGCG CGTTGTGTCT
CTCACAGATG ACGATTCCAC GCTGGATGGG GATTTCACCC GAAATGAGCG CGGTATCGAC
AGCAACCAGC GGGCAACGGT TTTTGAAAGC GATGGCAGCA CATTGGCGCG TATCGGCGGA
GATTTTGTGC AAAACGACCA AGTCTATGCA GAAAATCAAT ATGTGCTGAC CGGCGATGAT
GGCAGCGAAA TTACCGTTTA TGCGCTGGAA AGCGGCGGCA CACTTATTGG CTATCTTCCA
ACTGCGCCGC TTTTACGGGA TGTGAATTAC AGCTACCGGA CCCACAATGT CATCAACGAT
GACGAGCTGA CGGGCTCCTA TCAATATTGG TATCGTCAAT ACCTCGGCGA GGACGCGACC
GATGCCGGAT CCTATGATGA CATTCAGGGG GCCGTGATCG TGTGTTTTAC GCCCAACGCG
CAGGTCACCA CCCCGGAAGG TCCGCGCCGC ATCCGGGATC TTGCCGTGGG CGATCGCGTG
CTGACACGGG ACAATGGTTA TCAGACCCTG CGGTGGAAAT ATCACCGCCG CCTGTCACGC
GCAGATCTCG ACCGTCAACC GCATCTGGCG CCAATCATCT TTGAGCCAGA CGCACTGGCG
CCGGGCTGCC CCAAACGCCG CCTCAAAGTG TCTCCACAGC ACCGGATTCT CATTGAATCC
CACATGAGCG GGCTCCTCTT TGCCAGCAAT GCCATCCTCG CACCGGCCAA GGGGCTGGTG
AACGGCACCA GTGTGCGACA GGATCAAAGT GGCATGCCGG TCACCTATGT GCATCTGATG
TTTGATCGCC ATGAGGTGAT CGAGGCCGAT GGGCTCTATT CCGAAAGCTA CCATCCCGGT
GCCTGGGCGC TTGCGGCTGC CGAAGATGCG GTCCGGCGCG AGCTTTTTGA GATATTCCCG
GCTCTGGAGG CGGATCTCAA TGCCTATGGC CCTACATGTT ATCCCTCTAT AAATGCTCAA
GAAGCACGTC TACTGAAGGC ATGA
 
Protein sequence
MRQTRFFKMA QRADSIFDGG ITGMPTHDFF VYTPDALTFS GGTIRVNSGF DPLSDRRVVS 
LTDDDSTLDG DFTRNERGID SNQRATVFES DGSTLARIGG DFVQNDQVYA ENQYVLTGDD
GSEITVYALE SGGTLIGYLP TAPLLRDVNY SYRTHNVIND DELTGSYQYW YRQYLGEDAT
DAGSYDDIQG AVIVCFTPNA QVTTPEGPRR IRDLAVGDRV LTRDNGYQTL RWKYHRRLSR
ADLDRQPHLA PIIFEPDALA PGCPKRRLKV SPQHRILIES HMSGLLFASN AILAPAKGLV
NGTSVRQDQS GMPVTYVHLM FDRHEVIEAD GLYSESYHPG AWALAAAEDA VRRELFEIFP
ALEADLNAYG PTCYPSINAQ EARLLKA