Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0803 |
Symbol | |
ID | 4076192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 850956 |
End bp | 852005 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638006101 |
Product | phage putative head morphogenesis protein, SPP1 gp7 |
Protein accession | YP_612798 |
Protein GI | 99080644 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.558654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000350456 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGGTGA ATGACGACCT CGCGGACGAA CTGATCCGTC ATCAGGTCTA TCTACAGCGC TTCGGAAACG CTACGGCCCG CAAGGTTCTG GCGCTGCTCA AGCGGTCTGA CTCGCGGCTA ATTGAGCGTC TACTGCGCGA CGACCTGTCT GGGCTTTCAC GGACCCGGCA AGAAGCGCTC CTGCGGGAGC TGCGCAGGAT AATCGATAGC GCGTTTGAGG ATGCCACAGG GGCGCTACAA ATCGACCTCA ACGGCCTCGC GGTCTACGAA GGGGAATATC AGCTCGAAAT GTTCCGACGG GTGCTGCCGG TAAAGTTCGA GATGGTCGGG CCTGCTGCTG ATCAGATCTT GGCTGCTGTG AACAGCCGGC CATTTCAAGG TAAGTTGCTG AAAGAGGTCT ATTCGGAGCT AAGCGCTAGT TCGTTCCGCA AGGTCCGTGA CACCATCCGG GCCGGGTTCG TTGAGGGGCG CACCACAGAT GAGATCGTGC GCGATCTGCG CGGCACCAAG GCGCAGGGTT TCAAGGACGG CGTGCTGGAC ACCAACCGCC GGGCGACGGA AACGGTAGTA AGGACAGCGG TTAACCATAC CGCCAACACG GCGCGTGAAT ACACATATGA GCGCAACGCC GACCTCGTGA AGGGGGTGCG CTGGAACAGC ACGCTCGACG GCCGCACTTC GGCGGTCTGC AGGGCGCGGG ATGGCAAGGT TTACGATCCG GGCAATGGGC CAAGACCGCC GGCACATTTC AATTGTCGCT CCAGCACATC GCCGGTTCTC GCGTCTTGGC GCGATCTGGG CTTTGACATT GACGAACTAC CGCCATCCAC CCGCGCGAGC ATGAACGGGC AGGTTCCGGC GGATCAGGAC TATGATACAT GGCTGAGAAA ACAGCCTCGG GCTTTTCAGG TCGAGGTTCT CGGTGAAACT AAAGCAAAAC TGTTCCGGGC TGGTCTTAAG ATGGATCGCT TCATTGACAG GAAAGGCCAA GAGCTTACCC TGACAGAACT GAAACGCCGG GAGCGCGACC TTTGGGAAAA AGCCACCTAA
|
Protein sequence | MAVNDDLADE LIRHQVYLQR FGNATARKVL ALLKRSDSRL IERLLRDDLS GLSRTRQEAL LRELRRIIDS AFEDATGALQ IDLNGLAVYE GEYQLEMFRR VLPVKFEMVG PAADQILAAV NSRPFQGKLL KEVYSELSAS SFRKVRDTIR AGFVEGRTTD EIVRDLRGTK AQGFKDGVLD TNRRATETVV RTAVNHTANT AREYTYERNA DLVKGVRWNS TLDGRTSAVC RARDGKVYDP GNGPRPPAHF NCRSSTSPVL ASWRDLGFDI DELPPSTRAS MNGQVPADQD YDTWLRKQPR AFQVEVLGET KAKLFRAGLK MDRFIDRKGQ ELTLTELKRR ERDLWEKAT
|
| |