Gene TM1040_1324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1324 
Symbol 
ID4078367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1409890 
End bp1412040 
Gene Length2151 bp 
Protein Length716 aa 
Translation table11 
GC content48% 
IMG OID638006632 
Producthypothetical protein 
Protein accessionYP_613319 
Protein GI99081165 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.772779 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCC CTCGCTGGAG GTGGAAGAGT ATCTATACTA CAAACTATGA CACTCTCGTA 
GAACAATCGT TCGATAGTGC CAAAAAGAGG TGTCGAGTAT ATTCTTCTAA CTTTGACTTC
CATATAGATG AGGAAGAATA CGATTGCGAG CTATTTAAGA TTCATGGGAC AATTGAAAAA
GATATTTCAA CCGGCCATCA TAGCAGAATT ATCCTGACTG TGGATGACTA CGATCATACT
GAAGAGTTTC GAGAACAGCT TTATGATCGG ATGCGCGGCG ATCTGGCAGG TGCCGATCTT
GTGATAATTG GGCAATCACT GTCCGATCCA GACATGGATA CTCTGGTGAG GCGGGCTGTA
AAGATCAATG AAAAAGCGCT ATCCCCTGGT CAGATCACAC TTCTGATCTA CTCTGAAAAC
CAAAGTCGTG CGCTCCTTCA AGAACGCAAA GGGTTGCGTG TAGTCTTCGG CGGAATTGAC
GAGTTCTTCA CGCGACTCGA CAAAAGACCG CCAAAGATCG CACCCGTCAT GATTTCGAAA
GACGACGAGG CGCTAATCGG CACGCATATA TCAAACTCTA TTGTCGAAGT GTCGGAAGTT
TCCAATGGAG CCGAAGCCGA CGTCAGTTCG ATGTTCAACG GCTGGCCTGC AACTCACCGA
GAGGTCGAAG CGGGCCTAAC ATTCGAACGC GATATCTCAC GTGAGGTTGC TAAACACTTT
GAAGATCCTT GTACCCTTTC TGGGGTTATT CTAGGGGCTG CCGGAGTAGG CAAGTCAACC
GCTGCCAGAC AGACATTACA ACTCCTCCGC CGCGCAGGCT TCAGGGCCTG GGAGCATGTG
AATGAACATA CTCTTAGCGT CGCCAACTGG AGAAAGATTG CCGGAAACCT CAAGGAAAGT
GAGCTCTTAG GTGTCCTGCT TGTAGACGAA GCACATTCTC ATTTGCACCA ACTTAATGAG
CTGATGGACC TGCTTGTCGC AGATGACAAT CCGCACCTGA AAATTCTTGC TGTTTCAACA
AAAAGCAATT GGATACCTCG AAGCAAAACA CCAAACTTCA ATCGTTGTGG GAAGGATTTT
TGGCTTTCAA AACTGTCTGT AGATGAGATC GACCGGCTCT TGAATTTAAT TGAACGACAG
CCGCGCATCC GTGAGCTTGT CGAGGAGTCT TTTAGCGGAT TCAACAAAGG CGAGCGCAGA
AGGCGGTTGG TGCATAGATG CGAAGCAGAC ATGTTTGTTT GTCTGAAGAA CATTTTTGCG
TCTGAGTCGT TCGACGATAT CATCCTTCGA GAATTTGCTG GCTTGGACGC CATCCCTCAG
GACATTTATA AACACGTCGC AGCTATGGAG ACACTCGGAG TACGCGTCCA CCGCCAACTT
GCCGTTCGCA TGCTTAACGT TGAAGCGAGC AATATCTCAA ATATTTTGAC TTCCTTGGAT
GATATCGTCG AAGAATATCC GGTTGATCGA CGCAAAGGAA TATATGGGTG GAAGTGCAGG
CATGGTGTCA TTTCAGAAAT TGTCACTAGA TACAAATTTG GGGATCTCGA ACAGATAATA
TCCCTCCTTG ATCACGTCAT AGATAACATT TCTCCGTCCT ACGATATTGA GGTTAGAACC
CTGCGCGAGC TTTGCAATCT CGACGGCGGA ATTTCACGAA TTCCCGAAAA GAAAGAGCAG
AATCGTCTAT TGAGGCGAAT GATATCCATG GCCCCAGGCG AGAGGGTTCC GCGACATCGG
CTGATCAGGA ACCTCATCGA CCAAGGTGAG TTTGAGAAAG CGGAAACGGA GATTCGCGTC
TTCAACTTTG ACTTCGGCTC AGACGGCCCC GTCCATCGCT ATAAAATTAA GTTGATGGTT
GCTCGAGCAG CTCGTGCGCC TGGTTTGCTC GATGAGGATA GAATTGTTAT TCTTGAGCAA
GCCCAAGAAC TTGCTTCGAC GGGTATCGCC AGGTTTCCGA ACAATAAGAG TATTCTGTCC
GCATATGCAG AGCTAGGTTT GGAGTATCTT CGTCGAACCG GGTCATACAG CTTTTTCGAT
GCCTCTATGG ATGAGTTGAA GGCTGCGGAA GGCCGACTCG GCGACCCAGA TATTACGGCT
ATGATCAGTC GCTTCGAACG TCGCGCAGCA GGCTCTGAGG CCGAACTCTG A
 
Protein sequence
MKIPRWRWKS IYTTNYDTLV EQSFDSAKKR CRVYSSNFDF HIDEEEYDCE LFKIHGTIEK 
DISTGHHSRI ILTVDDYDHT EEFREQLYDR MRGDLAGADL VIIGQSLSDP DMDTLVRRAV
KINEKALSPG QITLLIYSEN QSRALLQERK GLRVVFGGID EFFTRLDKRP PKIAPVMISK
DDEALIGTHI SNSIVEVSEV SNGAEADVSS MFNGWPATHR EVEAGLTFER DISREVAKHF
EDPCTLSGVI LGAAGVGKST AARQTLQLLR RAGFRAWEHV NEHTLSVANW RKIAGNLKES
ELLGVLLVDE AHSHLHQLNE LMDLLVADDN PHLKILAVST KSNWIPRSKT PNFNRCGKDF
WLSKLSVDEI DRLLNLIERQ PRIRELVEES FSGFNKGERR RRLVHRCEAD MFVCLKNIFA
SESFDDIILR EFAGLDAIPQ DIYKHVAAME TLGVRVHRQL AVRMLNVEAS NISNILTSLD
DIVEEYPVDR RKGIYGWKCR HGVISEIVTR YKFGDLEQII SLLDHVIDNI SPSYDIEVRT
LRELCNLDGG ISRIPEKKEQ NRLLRRMISM APGERVPRHR LIRNLIDQGE FEKAETEIRV
FNFDFGSDGP VHRYKIKLMV ARAARAPGLL DEDRIVILEQ AQELASTGIA RFPNNKSILS
AYAELGLEYL RRTGSYSFFD ASMDELKAAE GRLGDPDITA MISRFERRAA GSEAEL