Gene TM1040_3494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3494 
Symbol 
ID4075173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp527798 
End bp531232 
Gene Length3435 bp 
Protein Length1144 aa 
Translation table11 
GC content64% 
IMG OID638005009 
Producthypothetical protein 
Protein accessionYP_611728 
Protein GI99078470 
COG category[S] Function unknown 
COG ID[COG4717] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.364107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTCA AACGTCTGTC ACTGGATCGG TTTGGCCATT TCACCGATCA ACAGTTCGAT 
TTTGGCTCTG CCCACGATGG GCATGACTTT CATATCATCT ACGGGCGCAA CGAAGCCGGT
AAGACCACCA CGATGGAAGC CGTCCTGCGG CTGTTTTACG GCTTTCCAAC GCGCGAGGCC
TACGCGTTTC GCCATCCGCG CAACAATCTA CAGATCTCTG CGACGCTCGA TTTCAATGGT
GAGCTGCGCC AGTTCACCCG CCTGCCCACG CGCAGTGGCG CGCTGGTGGA TGAGAGCGGC
ACCCCACTGC CGGAGGCTGC GCTTTCGGCG CATCTCGCCG GGCTTTCGGA ACCGGATTAC
CGCCGTCTTC TGTGTCTGGA TGACGAGACG ATTGAACGCG GCGGCGAAGA GATCGCCAAT
GCTCAGGGCG ATATTGGCCG GCTGCTGTTC TCGGCCGCGG CGGGGGTTGC GGATCTGAGC
CAGGTGCTTG ATGGCGTGCG CAGCCGGGCC GATGAGATTT GGAAGAAACG CGCCCGCAAC
ACGCGGATGC GCGAGCTGAA ACGCGCGCTT GAGGAGCTCG ACAAGGAGAT CAAGGCCCGC
GATGTCTCTG CAAACGCGTG GAAATCCCTC AAGCGGGATC TCACCAAGGC GCAACAGGCT
GAAGAAGACG CGCGGACGCG TCGCGATGAT CTCAATACAA CCCGGGCCCG GACCGAGGCC
GAACGGCGCG CCGTGCCGCT TCTTGCAGAA CTTCAGGAGT TGGAACAGGC GCTTGCGCCC
TTTGCGGATT TTCCCGCACA GCTGGATTTC AACCCCGAGC GGCTCGTCGA GTTGCGCAGC
GATTTAGGCA GCGCCACGCA GAACATCGCG CGCCTCAGCG ACGAACTTCA GACCCTTGAG
GAGGAGCAAG CGGCGCTTGT ATTGGATCCC GCACTTGAGG CGCTTCCTGC GGCACTCGAG
GCGCTTGAGG ATCTGAGTGC CCGCCACCGC ACCGCTGCGC TTGATCTGGA TCGGCGCAAG
GCCGAGCAGC AGGAGGCGCT CGAGACAATG CGGCAGGCCG CCCGCGACCT TGGTCTCGTC
GCAGAGAACG CCGACCTTGC GGCCTTGGTT CTTTCCGGCG CTGATATCAC CGCGCTTCAG
GACGCAAGAG AGCAACTTCG CGCGGCCATC ACCCACGAAG GTCTAGAAGC GCGCGAGGTC
GAGGATCTCG AGGAGCGCCT GCGCGTAGCG CAGGATGCTG TGCACGATTG TGCGCCGCTC
GCTGCAGAGG CCGGGCAGTT GCAAGAGCTG CTCTTGAATT TTGATGCAGA GCGGCTGGCG
ACGGCCCATG CAGCGGCAGC GGAGGCGATC ACAGCCGCCC AATCCCGGGC CGCCCGCAGC
CTGGCAGCGC TGTCTGTGCG CCACGTGACT TTTGATACCC TGCCGCATTG CCCGAGCAGC
TTGCGTCAGG CGCAGGAGCA CGCGGATCGT GACGCAGCGC TGGTTCAGGA CCTGCGCCGG
GCGACTGAGA CGCGCAATCA GCATAAGGAG GACGCCAGCG CCCGCGCCGC GCAGGCAAAG
GCCCTTGAAC GCGCTGCAAA TGTGGTCTCC GACAGTGAGG CCGAGGCCTT GCGCACCAAA
CGCGACGCAC TCTGGGCCGT GCATGTAGAC AGTTTGGATG CCGAGAGCGC CGCCACGTTC
CACGAAGCGC TGAGCCGCCA CGACACCGCT GCTGACGCGC GACTGACCCA ATCGCGAGAA
CTGGCGCAGC TGCGAGAGAT TGCGCAATCA GAGGCTGCGT CGAAGGCCCG CGCCGCAGAG
GCCGCCACCC GGATCACTGC GCTAGAAGAA CAGCGCGATG CCATTGCGGC AGAGGTCAAC
GCCGCTGCCA CCCAGCTTGG CCTTGCGCCG CTTTCCCCGG CAGAGTGGCG CGATTGGGTT
GAGCGTCATG AAACCGCGCG CGCGGATGCC GAGGCGCTTC GGGACACCAA GGACAGCCAT
TCGGCCACCT TTGCGCGGGC GCAGGCGTTG ATGGACGCCC TTGCAGAACG TGTACCGTCT
CTTCCGGCTG ATCTTGATCC GGCCCTTGCA CAGGCGCGCC GGATGGCAGA GGCGGCGCGC
CAGACCTCCG AGGCGCGCGG CGCCGCAGAA AAGCTCTTGC GGCAGGTGGA GCGCGACCTG
AATCGGCGAC AGGATCGCCA CAACGCCGCA CAAGAGGCCA AGAAAAAGGC CGAGGACACA
TGGCGGGCCT TGGTGCAGGA ACTTCTGGCA GGGCAGGTCT CGCCAGAGTC GCTTATGGCC
TCTTTGGAGC CACTGCGGGT GCTACGCGAG CATGACAAGA CCCGCAGCGC CGCCGCACGT
CGGGTCCGGA TGATGGAGGC GGATCAGGCG CTTTTTGCTC AAGAGGTCAA CGCCTTGGCA
CAGCTGCACG GGGTCGGGGT CAGTTCGGAC CCGGGCCAGA CCTATGGCGA CCTCAAATCC
CTGGCCGAGA CGGCCCGCAT CGCGCGCGAC AAGGCCGCGC GTCTGGAGCG TGCAATTGCA
ACGGCCACGG CAGAACGTGC GGACAACACC GCGCGGGTTG AGGCCATAGA TCAGGAGCTC
CGGGCGATCG CTGCGAGCTT TCCCGAGCCC CCACCTGCAC AGGACATCGA CACGTTGCGC
CAATGCGCCG CGCAGGCCCA GAAGGTCATT GCCGACCGCG CGGCCCGCGA CCGCCTGCGC
CGTCAGATCC TCTCCGAGCT CGACCTGTCG GATCTGGAGT CGGCGCGCGC GCAGCTTGCC
GAGACCTCGG TTGCCACGCT CAACGCGCGC CTGGAAAGCA CCCTCTCCGA TCTGACCCAC
GCCGAAGACG ACCTCACGCA GGCGATCCAA CAGCGGGTCA ACGCAGAACA TGCGCTGGCA
GAGATCAACG GCGACCGCAC GGCAGCGGCG CTGGTCGAGC AAAAAGCAAC GCTGGAACTG
CAGCTCGAAG AGGCCGCGCT CGAACATCTG GAGCTGTCTT TGGGGCATCA CCTGGCCTCA
GACGCAATCA AGCGCTACCG CGACAGCCAC CGCAGCGGCA TGCTGACCGC CACGGAACAG
TGTTTTGCCG ATCTGACGCA GGGCGCCTAT CCCGCCCTCA GCACACAGAT CTACGGCGAC
AGCGAGGTCC TTTTGGCGGT CGACAGAACC GGTGCCAGCA AACGGGCCGA CGAGATGTCC
AAAGGCACCC GGTTCCAGCT CTATCTGGCG CTCCGGGCCG CAGCGCATGA ACAACTGGTG
GCGCAGGGCA CCATATTGCC GTTCTTCTGC GATGACATCT TTGAGACCTT TGACGAGACC
CGAACCAGTG CGGCCTGTCA GGTGATGGAA GCCATCGGCA CGCGAGGACA GGCGATCTAT
CTCACCCATC ATCGCCATGT GGTTGAAATC GCCCAAAGCG TCTGTGCCAC CCCGCCAATC
ATTCACGAGC TCTGA
 
Protein sequence
MRLKRLSLDR FGHFTDQQFD FGSAHDGHDF HIIYGRNEAG KTTTMEAVLR LFYGFPTREA 
YAFRHPRNNL QISATLDFNG ELRQFTRLPT RSGALVDESG TPLPEAALSA HLAGLSEPDY
RRLLCLDDET IERGGEEIAN AQGDIGRLLF SAAAGVADLS QVLDGVRSRA DEIWKKRARN
TRMRELKRAL EELDKEIKAR DVSANAWKSL KRDLTKAQQA EEDARTRRDD LNTTRARTEA
ERRAVPLLAE LQELEQALAP FADFPAQLDF NPERLVELRS DLGSATQNIA RLSDELQTLE
EEQAALVLDP ALEALPAALE ALEDLSARHR TAALDLDRRK AEQQEALETM RQAARDLGLV
AENADLAALV LSGADITALQ DAREQLRAAI THEGLEAREV EDLEERLRVA QDAVHDCAPL
AAEAGQLQEL LLNFDAERLA TAHAAAAEAI TAAQSRAARS LAALSVRHVT FDTLPHCPSS
LRQAQEHADR DAALVQDLRR ATETRNQHKE DASARAAQAK ALERAANVVS DSEAEALRTK
RDALWAVHVD SLDAESAATF HEALSRHDTA ADARLTQSRE LAQLREIAQS EAASKARAAE
AATRITALEE QRDAIAAEVN AAATQLGLAP LSPAEWRDWV ERHETARADA EALRDTKDSH
SATFARAQAL MDALAERVPS LPADLDPALA QARRMAEAAR QTSEARGAAE KLLRQVERDL
NRRQDRHNAA QEAKKKAEDT WRALVQELLA GQVSPESLMA SLEPLRVLRE HDKTRSAAAR
RVRMMEADQA LFAQEVNALA QLHGVGVSSD PGQTYGDLKS LAETARIARD KAARLERAIA
TATAERADNT ARVEAIDQEL RAIAASFPEP PPAQDIDTLR QCAAQAQKVI ADRAARDRLR
RQILSELDLS DLESARAQLA ETSVATLNAR LESTLSDLTH AEDDLTQAIQ QRVNAEHALA
EINGDRTAAA LVEQKATLEL QLEEAALEHL ELSLGHHLAS DAIKRYRDSH RSGMLTATEQ
CFADLTQGAY PALSTQIYGD SEVLLAVDRT GASKRADEMS KGTRFQLYLA LRAAAHEQLV
AQGTILPFFC DDIFETFDET RTSAACQVME AIGTRGQAIY LTHHRHVVEI AQSVCATPPI
IHEL