Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smal_2500 |
Symbol | |
ID | 6476989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stenotrophomonas maltophilia R551-3 |
Kingdom | Bacteria |
Replicon accession | NC_011071 |
Strand | - |
Start bp | 2806403 |
End bp | 2807653 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642731686 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_002028884 |
Protein GI | 194366274 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00454127 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAGA TGACCCACGG CCGCGTCCCG CGCGGCCTCG TTTCCGTGCG CGCCGAAGGC GCAAACCAGC CCGACGTGAA GGCGCTGGTG GAGTCGCTGA ACAAGGCATT CGCCGACTTC AAGGCCGAGC ACACCAAGCA GCTGGAAGAG ATCAAGAAGG GCAGCGCCGA TGCACTGCAG GCCTTGAAGG TCGACAACAT TAATGCCGAC ATCACCCGCC TGCAGGCCGC GGTCGACCAG GCCAACACCC AAATGGCCGC GTTCCAGATG GGCGGTGGTA GCGCCGGCAG CGGTGTCGCC GACGCCGAGT ACACCGAGTC GTTCCGCGCC CACTTCCGAA AGGGTGAAGT GCAGGCGGCC CTGAACAAGG GTGCTGCCGA TGAAGGTGGC TATCTGGCGC CGATCGAATG GGATCGTTCG ATCACCGATC GCCTGGTCAT CGTGTCGGAT ATGCGGCAGT TGGCCAACGT GCAGCCCTGC TCCGGCGCAG GCCTGACCAA GCTCTACAAC ACCGGCGGCA CTTCCTCGGG CTGGGTGGGC GAAGAGGATC CGCGCCCGGA GACCGCGACT GCGAAGCTGC GCCCGCTCAG CTTCGGCTGG GGTGAGATCT ACGCCAACCC GGCAGCGACC CAGCAGCTGC TGGACGATGC CGAGATTGAC CTGGAGGCGT GGCTGGCCGG CGAGGTCGAG CTGGAGTTCG CCAAGCAGGA GGGCGATGCG TTCTTCTCCG GCAATGGCGT CAACAAGCCG TTCGGCATCC TGACCTACGT GGACGGTGGC GCCAACGCGG GCAAGCACCC GTTTGGTGCG ATCAAGGTGG TGAACAGCGG GCTGGCGGCC GGCATCAACG GTGACAGCAT TCTGGACCTG GTCTATGACC TGCCGTCGGC ATTCACCGCG GGCGCCAAGT TCGCGCTGAA CCGCAAGACC CAGGGTGTGG TGCGCAAGCT GAAGGATGCC CAGGGCAACT ACCTGTGGCA GCCGTCGCTG GTGGCGGGTC AGCCGTCGAC CCTGGCCGGC TTTGCGGTGC AGGACGTGGC TGCGATCCCG GACGTGGCAG CAAACGCCAT TGCCGCGCTG TTCGGCGACT TCAAGCAGAC CTACACCGTG TACGACCGCA AGGGCGTACG CGTGCTGCGC GACCCGTACA CCAACAAGCC CTACGTGATG TTCTACACCA CCAAGCGCGT GGGTGGCGGT GTGCACAACC CGGAGCCGAT GCGCGCCCTC AAGATCGCGG CTTCGGCCTG A
|
Protein sequence | MTKMTHGRVP RGLVSVRAEG ANQPDVKALV ESLNKAFADF KAEHTKQLEE IKKGSADALQ ALKVDNINAD ITRLQAAVDQ ANTQMAAFQM GGGSAGSGVA DAEYTESFRA HFRKGEVQAA LNKGAADEGG YLAPIEWDRS ITDRLVIVSD MRQLANVQPC SGAGLTKLYN TGGTSSGWVG EEDPRPETAT AKLRPLSFGW GEIYANPAAT QQLLDDAEID LEAWLAGEVE LEFAKQEGDA FFSGNGVNKP FGILTYVDGG ANAGKHPFGA IKVVNSGLAA GINGDSILDL VYDLPSAFTA GAKFALNRKT QGVVRKLKDA QGNYLWQPSL VAGQPSTLAG FAVQDVAAIP DVAANAIAAL FGDFKQTYTV YDRKGVRVLR DPYTNKPYVM FYTTKRVGGG VHNPEPMRAL KIAASA
|
| |