Gene TM1040_3538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3538 
Symbol 
ID4075216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp575970 
End bp578069 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content56% 
IMG OID638005052 
Productprotein of unknown function DUF1524 RloF 
Protein accessionYP_611771 
Protein GI99078513 
COG category[S] Function unknown 
COG ID[COG1479] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00806563 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCA CCGATGCCAA CCTCCTGAAC TTCATCAACA GCGCGCCGCA GTTCGTCATT 
CCGATCTATC AGCGGACCTA TTCCTGGACC GAGCGTGAGT GTCGGCAGTT GTGGGAGGAT
ATCCGGCGTG CGGGGCGCAG TGGTCATATC CCCGTGCATT TCCTTGGCTC GGTCGTTTAC
ATCGAAGAAA GCCTGTCCAA CACGTCCAGC CGCGCGCCGC TGCTTGTCAT CGACGGCCAG
CAGCGCCTGA CCTCGGTCAC GCTGCTCCTT TCTGCTCTGT CGCGCAAGGT GGCCGACGAG
GAGCCGGTCG ATGGCTTTTC GTTCAAGAAG ATCCGCAACC GGTACCTGAT GGACCCGGAT
GAAAGCGGCG AACGTGCTTT CAAGCTGTTA TTGTCCAAGA CGGACCGAAC GACGCTGAAC
GCGGTGGTGT CGGGTGATGA CCTGCCCGAG CCGCAATCGA TGCGTGTTGC CGAGAACTTC
AAGATGTTCT CACGTTGGCT CGACCAGGAC GACACGGCTG TGTCCGAGGT TTGCTCTGGC
CTTGCCAAGC TGATGATTGT GGACGTGGCG CTCAGCCGGG AACATGACAA CCCTCAGCTG
ATTTTCGAAA GCATGAACTC GACGGGCAAG GAGCTGTCGC AGGCCGATCT GATCCGCAAT
TTCGTCTTGA TGGGGTTGGA GCCGAAACTG CAAACACGGC TCTATGAGCA ATACTGGCGG
CGCATGGAAG AAGGTTTTGG TCAGGCAGCC TATGCCTCGC ACTTTGATGG CTTCATGCGG
AATTATCTGA CGGTCGTGAC AGGGTCGATC CCTAGGCTTG ATGATGTCTA CGATGCGTAC
AAGGCGTATT CCCAGGAACA AATTGGCGGT GGCCGTGAGG TGGAAGACCT TGTGCGGGAG
GTCTGGGAAT TCTCGCAGTA CTACGGGGCG CTCGTACTGG GGCAAGAGAA AGATAAGGCC
TTGGCGCTCG GCTTCAGGGA TCTGCGAGAG CTGAAAGTTG ATGTCGCCTA CCCATTCCTT
CTGGAGGTCT ACAAAGACTA TAAGACGGGC GCTTTGAGCG CTGACGATTT CCTTGAAGTG
GTCCGCTTGG TCGAGGCCTA TGTCTTCCGT CGTGCGATCT GTTCCGTGCC CACGAACTCC
CTGAACAAGA CCTTTGCGAC ATTTGGCCGA GAGCTGAAAA AAGATCGCTA CCTGGAAAGC
GTCAAGGCAC ACATGCTCAA CATGAAGTCG TATCGCCGGT TTCCGCGTGA TGAAGAGTTC
GTCGCGCGCA TTCAGGACCG CGACCTCTAC AACTTCCGCA GCCGCACTTA CTGGCTTCGG
AAGTTCGAGA ACTTCGGAAG AAAAGAGCGC GTCGAGGTCG ACGATTACAC GATTGAGCAC
ATCCTGCCGC AGAACGAAAA GCTGTCAACG GAATGGCAAG CCGCCCTTGG ATCGGACTGG
GCTGCCGTGC AGGAGAAATG GCTTCACACG TTGGGGAACC TGACACTTAC CGGGTATAAT
TCCGAATACA GCGACAAGAG TTTCGCTGAG AAGCGCGACA TGGCTGGCGG CTTCAAGGAA
AGCCCTTTGC GCGTCAACAA CGGCTTGGGT TCGTTGGATG CGTGGAACGA AGCTGCGATC
AAGTCGCGAG CCCAACGACT TGCGCAGCAG GCCGCTGAAG TTTGGGCCGC ACCTTCTCTG
AGCGCCGAAA TCCTGGCGAC CTATCGTACC CCGACTGAAA TGGCCAACGG TTACACCATC
GAAGATCATC CTCACCTTAC ATCGGGCAAG ATGAGAGACC TCTTCGAGCA GTTCCGTCGC
GAAGTGATGG CGTTGGACCC GAGTGTAAGT GAACAGTTCC TGAAGATTTA CGTTGCTTAC
AAGGCCGAAA CAAATTTCGC GGATGTCGTA CCACAGGCCA AGGCGCTCAG GATTTCTATC
AACATTGAAC CTTATGAACT GCACGATCCG CGTGGGATGG CGGTCGATGT CACTGATGTA
GGTCGATGGG GCAATGGGAA TACCGAGGTC AAGCTAACGG ACCACGAGGA CCTGCCATAT
GTCTTGGGCC TGGTGCGGCA AGCACTGGAC CGTCAACTCG GAGCGGATGA GCCAGCATGA
 
Protein sequence
MKATDANLLN FINSAPQFVI PIYQRTYSWT ERECRQLWED IRRAGRSGHI PVHFLGSVVY 
IEESLSNTSS RAPLLVIDGQ QRLTSVTLLL SALSRKVADE EPVDGFSFKK IRNRYLMDPD
ESGERAFKLL LSKTDRTTLN AVVSGDDLPE PQSMRVAENF KMFSRWLDQD DTAVSEVCSG
LAKLMIVDVA LSREHDNPQL IFESMNSTGK ELSQADLIRN FVLMGLEPKL QTRLYEQYWR
RMEEGFGQAA YASHFDGFMR NYLTVVTGSI PRLDDVYDAY KAYSQEQIGG GREVEDLVRE
VWEFSQYYGA LVLGQEKDKA LALGFRDLRE LKVDVAYPFL LEVYKDYKTG ALSADDFLEV
VRLVEAYVFR RAICSVPTNS LNKTFATFGR ELKKDRYLES VKAHMLNMKS YRRFPRDEEF
VARIQDRDLY NFRSRTYWLR KFENFGRKER VEVDDYTIEH ILPQNEKLST EWQAALGSDW
AAVQEKWLHT LGNLTLTGYN SEYSDKSFAE KRDMAGGFKE SPLRVNNGLG SLDAWNEAAI
KSRAQRLAQQ AAEVWAAPSL SAEILATYRT PTEMANGYTI EDHPHLTSGK MRDLFEQFRR
EVMALDPSVS EQFLKIYVAY KAETNFADVV PQAKALRISI NIEPYELHDP RGMAVDVTDV
GRWGNGNTEV KLTDHEDLPY VLGLVRQALD RQLGADEPA