Gene TM1040_1956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1956 
Symbol 
ID4077140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2060073 
End bp2061236 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content62% 
IMG OID638007271 
ProductHflK protein 
Protein accessionYP_613950 
Protein GI99081796 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0330] Membrane protease subunits, stomatin/prohibitin homologs 
TIGRFAM ID[TIGR01933] HflK protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.733622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.100531 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGGA ACAGTGGTGG CCCCTGGGGG GGCGGCGGTT CCTCCGGCGG CGGAGGCAAC 
CGGGGAAACA ACGGCGGCAA CAACGGAGGC GGCGGCGGTG GCCGGCGTCC TGATGATGGC
CAGATCCCCG AGATCGACGA GCTGGTGAAG AAAGGCCAAG AGCAGCTGCG CGTCCTGATG
GGCGGCCGTG GCGGCAATGG CGGCAACGGG CAGGGACCGC AGGGCGGTGG CTCTGGCGGC
AGCCCCTTGT TCACCAAAGG CGGGCTGATG CTCGGCGCGG TCGCTGCGGT TTTCCTCTGG
GGTTACAACA GCTTCTACAC CGTAAAGACC GAAGAGAAAT CCGTCGAGCT GTTCCTTGGC
GAGTTCTCCG CGGTGGGCAA TCCGGGTCTG AACTTCGCAC CCTGGCCTGT GGTCACCTAT
GAGGTAGTCC CGGTCAGCGT GGAACAGACC GAGAGCATTG GCGCAGGCGC GCGCGGCTCT
GATGCGGGTC TGATGCTGAC GGGCGATGAG AACATCATCG ACGTCGACTT CCAAGTGGTC
TGGAACATCA ACGAGCCCGA CAAATTCCTC TTTAACTTGC GCGATCCAAA GGCGACCATT
CAGGCCGTGT CTGAATCCGC GATGCGAGAG ATCATTGCAC AATCGCAGCT GGCGCCGATC
TTGAACCGTG ACCGCGGTCT CATTTCGCAG CGCCTCGAAG AGCTGATCCA GTCCACCCTC
GACAGCTATG ACGCAGGCGT GAACATCGTC CGGGTCAACT TTGACGGGGC CGATCCGCCT
GAACCGGTAA AAGACGCCTT CCGCGAAGTT CAGTCTGCCG GTCAGGAACG AGACCGTCTG
GAAAAGCAGG CTGACGCTTA TGCCAACCGC AAACTCGCGG CGGCGCGTGG TCAGGCCGCA
CAGACACTCG AAGAAGCAGA AGCCTACCGC GCGCAGGTCG TGAACCAGGC GCAGGGTGAG
GCCTCGCGCT TTACGGCCGT TCTGTCGGAA TACGAGAAGG CACCGGAAGT GACGCGCAAG
CGTCTCTATC TGGAAACCAT GGAGGACGTG CTGAGCCGCG TGGACAAGAT CATCCTTGAT
GACAACGCCG GGAGCGAAGG CGGACAGGGC ATCGTGCCGT ATCTGCCGCT CAATGAAATC
CGTCGTTCCG GAGGGAGCAA CTGA
 
Protein sequence
MAGNSGGPWG GGGSSGGGGN RGNNGGNNGG GGGGRRPDDG QIPEIDELVK KGQEQLRVLM 
GGRGGNGGNG QGPQGGGSGG SPLFTKGGLM LGAVAAVFLW GYNSFYTVKT EEKSVELFLG
EFSAVGNPGL NFAPWPVVTY EVVPVSVEQT ESIGAGARGS DAGLMLTGDE NIIDVDFQVV
WNINEPDKFL FNLRDPKATI QAVSESAMRE IIAQSQLAPI LNRDRGLISQ RLEELIQSTL
DSYDAGVNIV RVNFDGADPP EPVKDAFREV QSAGQERDRL EKQADAYANR KLAAARGQAA
QTLEEAEAYR AQVVNQAQGE ASRFTAVLSE YEKAPEVTRK RLYLETMEDV LSRVDKIILD
DNAGSEGGQG IVPYLPLNEI RRSGGSN