Gene TM1040_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3843 
Symbol 
ID4074906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp90406 
End bp92100 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content49% 
IMG OID638004500 
Producthypothetical protein 
Protein accessionYP_611235 
Protein GI99077976 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.101575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATG TTCTTCCGTT TGGAAAAAAA ACGCATTCCT CGCTTAGTGA GAACACTCAA 
GACGTATTAG ACCTGAGCAC ACTGACGGAT GGCGCGGCGG ATTTCTCCAC CGGAATGGTT
GACCGGTCAT CAGGGGGCCT CTCACACCTC CCTAACCAAA CTGCAGTCAC ACCCGATTGG
GCCAATCAAG AAATTGCTAG CTTCATGCGA GCACATCGGT TGCTTGCCAT TGCGGGTCTC
TCCTTGGAAA CAGAGCGCGG AGTGACCGAC GAAGGTGACC CTTGGTTTGT TTTCGTAAAT
CAAGATGGAG ACGTATTCGC GCATTTCGCG CGCATCGGCG AAGTCTACAT CCTCGACAGT
ATTATTCAGC AGGAAATTAC AAAAGCCAGA AGCTTGGATG AACTGATTTC AGCCTTTGCA
GAAACCACTA CACCGGTGGA CAGTGTAGAC CACGCTTCAA CATCTGCCAC AGTTGTTCCC
TTCATCGCCT TGCGAGAATC AAAAGTACGT CTACACCCAG GCGCAACCTT GGCCGCTCTT
ATCTGGACAA TCTATATCCA GTCAGGAGAA CTTGCGGTCC CATCATTTAG TGCTGCGCTG
GATGTGACTG CAACGTCAAA GGCCGATGAA GTGACCACGC TACCCTCTCA AACCCGGGCC
TCCCCCTCGG AAGAGTTGGC TACGAGTACA TCTGAAGATA ATTCAGCAGA TCTCAAAAGT
ACTGAAAAGG ATACACACAC TCAAACGCCC TCAAGCGCTC AAGTGGCAGC GACATTTACT
ACCGTGCAGA CGGTTGGAAT GGGATTGAGC GCTATAGCCA TCTCGAATGG CATGTACTTT
TGGGTTTCAG GTGAGACCTT GCTCGAACAA GCGACACTGG CATCTCAGCT AGTCGCCGAA
GTAATCAAAG ATATATCTGA TGAAGAAACT GAGCAGCTTG CCGATCTGTC GGAGCTAGAT
AGCGTATTAT CCACAGTTCG CGCAGCGATA GAGGACAGTT CAGAGGCAAT GGCTTTGGCT
CAATCGCCAA TACGAGATAT TCCTACTGTC GACTTGGCAT CCATGCCGGC TATCGTTGCT
AACGCTGTTG AAAATGGAAA AGTTGGGAAC GCAATCAAAG CTAAAGACGT TCAAGGCGAC
GTAGCTTTTT CTGAAGAGAT GGATGGTGAG CTCATCAAGT TCCGGGATAT TCAGGCGAAC
CGCCCGGAGG AAATAGAGAC ATCTCAGATT CGCTCTACAC CAGTAGACAT TGAGAAAAAA
TATATTTTAC AAGAGCTTGA CGAGGTTGAG CACTTCTCGC TTACGTCCGA CGCTTTTAGC
AATATAGCTA CCCAGCTTTC TGACCTACCA TTCTTCACGC AAATGCTCAG CGGCTCATTC
CAAGCGGTCG TTGCCAGTGA GGGTTCTCAA ACTGTGCCCG CGTCAAGGCC CGACACCCTC
AATGATGCCT CCGAGACGGC ACCACGCTTC TCGATCTTTA ACGATGATGC GCACAATTTT
ATTGTATTCT TGATGAGTAA AGGGGATGAG ACCAAACGGT CTGACTACGA TAACGAAGTT
GTACTCTTTG ACTTTGATGC AATCGATAGT CAAACAGACG CGATCTATGC TCGTAGCTGG
TCCTTCGAAG ATGGATCTGT GATTTCTGCT GTAGGTTTAA AAAGCGACTT TGCTGCTTTT
GATCTTGTGA TCTAA
 
Protein sequence
MNNVLPFGKK THSSLSENTQ DVLDLSTLTD GAADFSTGMV DRSSGGLSHL PNQTAVTPDW 
ANQEIASFMR AHRLLAIAGL SLETERGVTD EGDPWFVFVN QDGDVFAHFA RIGEVYILDS
IIQQEITKAR SLDELISAFA ETTTPVDSVD HASTSATVVP FIALRESKVR LHPGATLAAL
IWTIYIQSGE LAVPSFSAAL DVTATSKADE VTTLPSQTRA SPSEELATST SEDNSADLKS
TEKDTHTQTP SSAQVAATFT TVQTVGMGLS AIAISNGMYF WVSGETLLEQ ATLASQLVAE
VIKDISDEET EQLADLSELD SVLSTVRAAI EDSSEAMALA QSPIRDIPTV DLASMPAIVA
NAVENGKVGN AIKAKDVQGD VAFSEEMDGE LIKFRDIQAN RPEEIETSQI RSTPVDIEKK
YILQELDEVE HFSLTSDAFS NIATQLSDLP FFTQMLSGSF QAVVASEGSQ TVPASRPDTL
NDASETAPRF SIFNDDAHNF IVFLMSKGDE TKRSDYDNEV VLFDFDAIDS QTDAIYARSW
SFEDGSVISA VGLKSDFAAF DLVI