Gene TM1040_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0801 
Symbol 
ID4076073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp848343 
End bp849593 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content61% 
IMG OID638006099 
Producthypothetical protein 
Protein accessionYP_612796 
Protein GI99080642 
COG category 
COG ID 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.887797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000709779 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCACACG ATCTGACGGT TAGCGCGCCG CAGGGTGTTT TCCTAAGCGG CCTCAACACC 
AAGTTTCGCG CCTACGTTGG CGGGTTCGGG TCTGGCAAGA CCTATGTGGG GTGCCTCGAC
CTCGGCTTGT TTGCAGGGCA GCACCCCAAG ACGGTTCAGG GATATTTCGC CCCGACGTAT
CGGGACATTC GAGACACCTT CTGGCCAACC GTAGACGAGG CCGCGCATTC GCTGGGGTTC
ACGACCAAGG TCAAGAGCGC CGACAAGGAG GTCGAGTTCT ACCGGGGCCG CAGCTACTAC
GGCACCACCA TTTGCCGATC GATGGATGAT CCGGGCGGCA TTGTGGGCTT CAAGATCGCT
CGCGCCCTGG TCGATGAGAT CGACATTCTC AGCAAGGACA AGGCGCAAGC CGCCTGGCGT
AAGATCATCG CCCGGATGCG CCTGGTTCTC CCCGGCGTGG TCAACGGCAT CGGCGTCACC
ACCACCCCCG AGGGGTTCCG GTTCGTCTAT GACAGCTTCA AGCGGGAGCC AAAGAGCAAC
TATTCGATGG TGCAGGCCAG CACCTACGAG AACGAGGCGT TCCTGCCGCC AGACTACATT
TCAACCCTGC TGGAGGACTA CCCCGAGGAG CTGATTAAGG CCTACCTCAT GGGGGAGTTC
GTCAACCTCA CGAGCGGCAC CGTCTATCGC AGTTATGACC GGTTGCGGCA TCGATCAACA
CAGAGCATCC AGCCGCGGGA GCCGCTGCAC ATTGGGCAGG ACTTCAATGT TGGCAACATG
GCCTCGGTGG TTTTCGTCCA GCGCGGCGAA GATTGGCACG CGGTCGATGA GCTGCAGGGG
CTGCAGGACA CGCCGCATCT GATCGAGGTT CTATGCGACC GATACGAGGG GCACCACCTC
ACGATCTACC CCGACGCCAG CGGTAGCAGC CGCAAGACTG TCAATGCCAG CACGTCGGAT
ATTGAGCTTC TGCGGAAAGC GGGTCACGCG ATCCGGGCGC CTAGCACCAA CCCGGCGGTG
AAAGACCGGA TCCTCGCAGT GAATACGGCC TTCGAGAATG GCCGCCTCTT TGTGAACGCT
CTCCGCTGCA AAGCCTACGC CGAAGCGCTT GAACAGCAGG CATATGACAA GAACGGCGAG
CCGGACAAAT CCGCCGGTCT CGACCACCAC CCAGACGCGG GCGGCTATTT CGTCCACCAG
AAAATGCCGG TCGTGAAACC GACCTTCACC CGGCAGGAGC TTCGCCTTTG A
 
Protein sequence
MSHDLTVSAP QGVFLSGLNT KFRAYVGGFG SGKTYVGCLD LGLFAGQHPK TVQGYFAPTY 
RDIRDTFWPT VDEAAHSLGF TTKVKSADKE VEFYRGRSYY GTTICRSMDD PGGIVGFKIA
RALVDEIDIL SKDKAQAAWR KIIARMRLVL PGVVNGIGVT TTPEGFRFVY DSFKREPKSN
YSMVQASTYE NEAFLPPDYI STLLEDYPEE LIKAYLMGEF VNLTSGTVYR SYDRLRHRST
QSIQPREPLH IGQDFNVGNM ASVVFVQRGE DWHAVDELQG LQDTPHLIEV LCDRYEGHHL
TIYPDASGSS RKTVNASTSD IELLRKAGHA IRAPSTNPAV KDRILAVNTA FENGRLFVNA
LRCKAYAEAL EQQAYDKNGE PDKSAGLDHH PDAGGYFVHQ KMPVVKPTFT RQELRL