Gene TM1040_3374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3374 
Symbol 
ID4075273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp387083 
End bp388558 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content58% 
IMG OID638004882 
Producthemolysin-type calcium-binding region 
Protein accessionYP_611608 
Protein GI99078350 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.337821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTA AACTGTTGAA ACCATTTTCG AATGTATTGG GGTGGAATGC CAAGAAGGGC 
CTCGTGGAGC CGAGCGACTC CAGCGACCCC TCGCGCGGGG ACGCCGAAAC GCAGCGCGAT
CCGGTTGAAG AGACCGATCC TATTTCGCGT GGTGACACCC AGCCCGATAC CGAGACAAAT
GATGGGCGCG ACACAGGCGG GTCTGACACG CCGGATCGCG GGGACGATCG GGACCCAGAT
CGTGGCGGCA ATGATGGGCC TCCGTCGATG TCGGTCAGCG CCAATATTTT TACGCGCTTT
GATACCAACC TGGCTGAGCC CGGCTTTCTA ACGCCCGGCC TGGCCAATGG TTTGAACATA
ACGCCCGGCA ACATCTGGAT CGATTGGAAT GGCGAGAACG CATTCGAAGG CACGCATGGG
GTCGACAAGG CCTGGGGCCT TTGGGGTGAC GATGACATCT ACCTCTACGA TGGCAACGAT
GTCGCCTATG GAGGTGCGGG GAATGACCTG CTTCATGGTG GCCGCGGCAA TGATGCGCTC
TATGGCGGGA TTGGCAATGA CTTCATCATC GGCGATGTCG GTAGTGACAC CATCGACGGT
GGCAGTGGAA ATGACAGGCT TCAAGGTGGC CGTGGCAATG ACATGATCAA CGGCGGCGAC
GGCAATGACC GGATCCTCGG AGAAGAAGGT CGCGACGTCG TGCGTGCCGG AGACGGTGAC
GATTTCATCG AGGGCGGTGA CGACAATGAC CTTATGTGGG GCGATGAGGG CCGCGACACC
ATGTTGGGCG GCGATGGGGA TGACCAAGTT TTTGGCGGCA TAGGCAATGA CCTGCTGCAA
GGTGGGCTTG GCAACGATCA TATGGAAGGC GGTGGTGGAG CCGATGTGCT GGTTGGCGAT
CAGGGGAACG ACCTCATCAA CGGCGGCAGC GGCAACGACA TGATGACTGG CGACCTTGGC
AATGACGAGC TGAATGGCGG GACCGGCAAT GACACTATGG AGGGTGGCGC CGGCAACGAC
ATCATGCGCG GTGGCGAAGG CAGCGATTTC ATCTACGGTG ACGGTGGCAA CGACTACATT
CAGGGCGGTG GTGGCAATGA CCATATTTAT GGTGGGACCA ACTCTGCCGG TGACATCGGC
GACTATCTGT TCGGCGGCGA GGGGTATGAT CTGTTCTACT TCGACTTCGG GGACAGTGGT
GCCAACGGTG GTCCTCGCGA TGTAATTCAG GACTTTTCGG CTGGTGGCGG GGATCTCATG
GTCTTTCAAG GGTTTGGCGC GACGACCTGG CGTGGAGGCG ATGGTTTCTC GGGTGGAGTT
GGAGCCGAAG CCTACTTCGA GCAGACGGTG AGCGACGAGT TCGGTGCGGT CACGCATGTT
TACCTGCGAG ACGACATCGG ACATGAGGCA GATCTTTCAC TGACCTTGCT GGGTTACATT
GATCTCACCG AGAGCGACGT CTTTGTGATC GCCTAA
 
Protein sequence
MTIKLLKPFS NVLGWNAKKG LVEPSDSSDP SRGDAETQRD PVEETDPISR GDTQPDTETN 
DGRDTGGSDT PDRGDDRDPD RGGNDGPPSM SVSANIFTRF DTNLAEPGFL TPGLANGLNI
TPGNIWIDWN GENAFEGTHG VDKAWGLWGD DDIYLYDGND VAYGGAGNDL LHGGRGNDAL
YGGIGNDFII GDVGSDTIDG GSGNDRLQGG RGNDMINGGD GNDRILGEEG RDVVRAGDGD
DFIEGGDDND LMWGDEGRDT MLGGDGDDQV FGGIGNDLLQ GGLGNDHMEG GGGADVLVGD
QGNDLINGGS GNDMMTGDLG NDELNGGTGN DTMEGGAGND IMRGGEGSDF IYGDGGNDYI
QGGGGNDHIY GGTNSAGDIG DYLFGGEGYD LFYFDFGDSG ANGGPRDVIQ DFSAGGGDLM
VFQGFGATTW RGGDGFSGGV GAEAYFEQTV SDEFGAVTHV YLRDDIGHEA DLSLTLLGYI
DLTESDVFVI A