Gene TM1040_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0803 
Symbol 
ID4076192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp850956 
End bp852005 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content59% 
IMG OID638006101 
Productphage putative head morphogenesis protein, SPP1 gp7 
Protein accessionYP_612798 
Protein GI99080644 
COG category 
COG ID 
TIGRFAM ID[TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.558654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000350456 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGTGA ATGACGACCT CGCGGACGAA CTGATCCGTC ATCAGGTCTA TCTACAGCGC 
TTCGGAAACG CTACGGCCCG CAAGGTTCTG GCGCTGCTCA AGCGGTCTGA CTCGCGGCTA
ATTGAGCGTC TACTGCGCGA CGACCTGTCT GGGCTTTCAC GGACCCGGCA AGAAGCGCTC
CTGCGGGAGC TGCGCAGGAT AATCGATAGC GCGTTTGAGG ATGCCACAGG GGCGCTACAA
ATCGACCTCA ACGGCCTCGC GGTCTACGAA GGGGAATATC AGCTCGAAAT GTTCCGACGG
GTGCTGCCGG TAAAGTTCGA GATGGTCGGG CCTGCTGCTG ATCAGATCTT GGCTGCTGTG
AACAGCCGGC CATTTCAAGG TAAGTTGCTG AAAGAGGTCT ATTCGGAGCT AAGCGCTAGT
TCGTTCCGCA AGGTCCGTGA CACCATCCGG GCCGGGTTCG TTGAGGGGCG CACCACAGAT
GAGATCGTGC GCGATCTGCG CGGCACCAAG GCGCAGGGTT TCAAGGACGG CGTGCTGGAC
ACCAACCGCC GGGCGACGGA AACGGTAGTA AGGACAGCGG TTAACCATAC CGCCAACACG
GCGCGTGAAT ACACATATGA GCGCAACGCC GACCTCGTGA AGGGGGTGCG CTGGAACAGC
ACGCTCGACG GCCGCACTTC GGCGGTCTGC AGGGCGCGGG ATGGCAAGGT TTACGATCCG
GGCAATGGGC CAAGACCGCC GGCACATTTC AATTGTCGCT CCAGCACATC GCCGGTTCTC
GCGTCTTGGC GCGATCTGGG CTTTGACATT GACGAACTAC CGCCATCCAC CCGCGCGAGC
ATGAACGGGC AGGTTCCGGC GGATCAGGAC TATGATACAT GGCTGAGAAA ACAGCCTCGG
GCTTTTCAGG TCGAGGTTCT CGGTGAAACT AAAGCAAAAC TGTTCCGGGC TGGTCTTAAG
ATGGATCGCT TCATTGACAG GAAAGGCCAA GAGCTTACCC TGACAGAACT GAAACGCCGG
GAGCGCGACC TTTGGGAAAA AGCCACCTAA
 
Protein sequence
MAVNDDLADE LIRHQVYLQR FGNATARKVL ALLKRSDSRL IERLLRDDLS GLSRTRQEAL 
LRELRRIIDS AFEDATGALQ IDLNGLAVYE GEYQLEMFRR VLPVKFEMVG PAADQILAAV
NSRPFQGKLL KEVYSELSAS SFRKVRDTIR AGFVEGRTTD EIVRDLRGTK AQGFKDGVLD
TNRRATETVV RTAVNHTANT AREYTYERNA DLVKGVRWNS TLDGRTSAVC RARDGKVYDP
GNGPRPPAHF NCRSSTSPVL ASWRDLGFDI DELPPSTRAS MNGQVPADQD YDTWLRKQPR
AFQVEVLGET KAKLFRAGLK MDRFIDRKGQ ELTLTELKRR ERDLWEKAT