Gene TM1040_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1061 
Symbol 
ID4077201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1139733 
End bp1140929 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content62% 
IMG OID638006365 
Productphage major capsid protein, HK97 
Protein accessionYP_613056 
Protein GI99080902 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0887541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACC ATCCCTTCAC GGGCCACGCG CCCGAGGATG CGGCGACCCC GCCGCAGAAT 
GTGGCCACGG AAGTGAAACA GGCTGTTTCG CAATTCGTGC AGCATTTCAA GGGGTTCCAA
GACGACGTGA CCCAAAAACT CAAACAGACG GAAGAGCGTA TGACCATGTT GGATCGTAAA
ACCCAAACCG CGGCGCGGCC GCATCTCGCA GCGGCCGAGG TCGATGGCGC ACCGCATCAA
AAGGCGATGC AAGCCTATCT GCGCCAGGGC GACGAGGAGG GCTTTCGCGG CCTCGACCTG
GAGGGCAAGG CCATGTCCAC GGCGGTGAAT TCCGATGGCG GTTTCCTGGT CGATCCGCAG
ACGGCGGATG TGGTGAGATC GGTGCTGCAC TCCACTGCGT CGATCCGTGC GGTGGCCTCG
GTGGTCAATG TGGAGGCGAC CTCCTTTGAC GTGCTGATCG ACCATTCCGA CGTGGGCGCG
GGCTGGGCCA CCGAGACCGG CTCGGTTACA GAGACCGGCA CGCCGTCCAT TGACCGTATC
GTGATCCCGC TGCACGAGCT CTCGGCCTTG CCCAAGGCTT CGCAACGGCT GCTGGATGAC
AGTGCCTTTG ACATCGAGGG CTGGCTTGCG GGCCGGATCG CCGACAAATT CGCCCGTGCC
GAAGCGCAGA GTTTTATCTC GGGGGATGGC GTGGACAAAC CCACCGGCAT CCTGACCCAC
CCCACGGTGG ACAATGGCAG CTGGAGCTGG GGGAACATCG GCTATGTGGC CACCGGCAGC
GATGGCGGCA TAGGCTCGGC AGATGCGATC GTGGATCTGG TCTATGCGCT GGATGCGCGC
TACCGCGCTG GGGCGAGCTT TGTGATGAAC TCCAAAACGG CGGGTCTTAT CCGCAAACTC
AAGGACGCCG ATGGTCGTTT CCTGTGGTCC GACGGTCTTC AGGCGGGCGA ACCTGCGCGG
CTGATGGGCT ATCCGGTGCT GGTGGCCGAG GATATGCCGG ATGTGGCCTC GGATAGTCTC
TCGATTGCCT TTGGGGATTT TGCCGCGGGC TACACGATCG CCGAGCGTCC CGATCTGCGC
GTTCTGCGCG ATCCCTTCTC TGCAAAACCG CATGTGCTGT TTTATGCCTC CAAGCGCGTG
GGCGGCGACG TGAGCGATTT TGCCGCGATC AAGCTGATGA AATTCGGGCT GAGCTGA
 
Protein sequence
MTDHPFTGHA PEDAATPPQN VATEVKQAVS QFVQHFKGFQ DDVTQKLKQT EERMTMLDRK 
TQTAARPHLA AAEVDGAPHQ KAMQAYLRQG DEEGFRGLDL EGKAMSTAVN SDGGFLVDPQ
TADVVRSVLH STASIRAVAS VVNVEATSFD VLIDHSDVGA GWATETGSVT ETGTPSIDRI
VIPLHELSAL PKASQRLLDD SAFDIEGWLA GRIADKFARA EAQSFISGDG VDKPTGILTH
PTVDNGSWSW GNIGYVATGS DGGIGSADAI VDLVYALDAR YRAGASFVMN SKTAGLIRKL
KDADGRFLWS DGLQAGEPAR LMGYPVLVAE DMPDVASDSL SIAFGDFAAG YTIAERPDLR
VLRDPFSAKP HVLFYASKRV GGDVSDFAAI KLMKFGLS