Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_2042 |
Symbol | |
ID | 5607168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 2236048 |
End bp | 2237973 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640937580 |
Product | terminase GpA |
Protein accession | YP_001478273 |
Protein GI | 157370284 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.324943 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0895639 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTATAT CGAACAAACG GATTGATCGG CTGCGTTATT GGGTTGCCGC CGGTCTGCGT TCGCTATTCC GCCCCGTTCC TATGACGGCG GTCGAATGGG CTAACGAATT TTATTACCTC CCCAAAGAAT CCTCCTATCA AGAGGGGCGC TGGGAAACGA TGCCGTTTCA GGTTGCGATC ATGAACGCAA TGGGGAGCGA TGACATCCGG GAGGTAAACC TGATTAAGTC GGCGCGCGTC GGCTATTCAA AAATGTTGCT GGCCGTCGTG GCGTATTTTA TCCAGCACAA ACAGCGAAAC GGGCTGTTGT GGCAGCCGAC GGATGGCGAC GCTGAAAATT TTATGAAGTC GCACGTCGAA CCGACGATCC GCGACGTTCC CAGTCTCTTG GCAATGGCGC CCTGGTACGG AAAAAAACAC CGCGATAACA CGCTTTCGAT GAAGCGTTTT TCGAACGGTC GGGGTTTCTG GTGCCTGGGA GGTAAAGCGG CAAAAAACTA CCGTGAAAAA TCGGTCGATT ATGTTGGCTA TGACGAACTG GCCGCCTTTG ATGAGGATGT GGAGAAAGAG GGTTCACCGA CGTTCCTGGG TGATAAGCGC ATAGAAGGCT CAGTGTGGCC AAAGTCGATA CGCGGATCTA CACCGAAAAT CAGGGGCATA TGCCAGATAG AACGCGCCGC CAGTGAATCC GGGCATTTGA TGCGTTTTCA TGTGAAATGC CCACATTGCG ACGAGGAGCA GTTTTTAAAG TTCGGCGATC GTGAAACGCC ATACGGTTTT AAGTGGGAAT CAGGGAAGCC GAAGACGGTT TTTTATCTCT GTGAGCATAA CGCCTGCGTG ATACGCCAGC AGGAACTTAA TTTTAGTGGT GCGCGTTATA TCTGTGAAAA CACGGGCCTT TACACGTCTG ATGGCCTCCG CTGGTTCGAA TCGACGGGGC AGGAGGTTGA TCCCCCTGAA TCAGTATCCT TTCACATCTG GACCGCTTAC AGCTCGTTTA CTACCTGGGC GCAAATCGTT AAGGACTTTA GAAAGACGAA GGGCGACCCG GGCAAGCTGA AAACGTTCAC CAACACGACG CTAGGCGAAA CTTGGGCCGA GGAAGTGGGG GAGCGGCCGT TACCTGAAAC CCTGGTTGAA CTGGCAGAAC ATTACCGGGC AGAGGTGCCC GATCGTGTGG TTTACCTCAC TGCCGGCATT GACTCCCAGC TCGACCGTTA CGAAATGCGT GTTTGGGGCT GGGCGCCAGG TGAAGAGGCA TTCCTGATCG ACCGCGTGAT TATCATGGGG CGGCACGATG AAGAGGAAAC GTTGCTGCGT GTTGATGAGG CCATCAACAA ACAGTATCAG CTGGCCGACG GCACGATCAT GACCATTGGC CGTGTTTGTT GGGACACCGG CGGTATCGAC CAGACGATAG TTTATAACCG CTCGAAAAAA CTGGGCCTCT TTCGGGTAAT CCCGATCAAG GGCGCCAGCG TTTATGGGAA GCCAGTGGCC AACATGCCGC GCAAGAAAAA CAGCCACGGC GTTTTTCTGA CAGAAATCGG CACTGACGTC GCCAAAGAAG TCATTTACAG CCGCTACAAA CTGGAACGCT CCGCTGATGG CTCCCCCGTT CCTGGGCTTA TCCACTACCC GAATAATCCG GCGGTTTTCG ACCTGACCGA AGCCGAGCAG ATGACGGCAG AGGAACTCAT AGAAAAATAT GAGAAAGGGA AAATTAAATT GCTCTGGGAC GCCAAAAAAC GCCGAAACGA GGCCCTCGAC TGTTTTGTTT ATGCCCTGGC GGCTTTGCGT ATCAGCGTTT CGCGCTGGCA GCTGGATTTG GATGTGTTAC TGGCCAGCCG CCAACAATCA CCGTCCGGCC AGCAGGCCAG AAATAATAAT GACTTGGCCG CCCTGGCGGC TCAATTGGGA GGATAA
|
Protein sequence | MSISNKRIDR LRYWVAAGLR SLFRPVPMTA VEWANEFYYL PKESSYQEGR WETMPFQVAI MNAMGSDDIR EVNLIKSARV GYSKMLLAVV AYFIQHKQRN GLLWQPTDGD AENFMKSHVE PTIRDVPSLL AMAPWYGKKH RDNTLSMKRF SNGRGFWCLG GKAAKNYREK SVDYVGYDEL AAFDEDVEKE GSPTFLGDKR IEGSVWPKSI RGSTPKIRGI CQIERAASES GHLMRFHVKC PHCDEEQFLK FGDRETPYGF KWESGKPKTV FYLCEHNACV IRQQELNFSG ARYICENTGL YTSDGLRWFE STGQEVDPPE SVSFHIWTAY SSFTTWAQIV KDFRKTKGDP GKLKTFTNTT LGETWAEEVG ERPLPETLVE LAEHYRAEVP DRVVYLTAGI DSQLDRYEMR VWGWAPGEEA FLIDRVIIMG RHDEEETLLR VDEAINKQYQ LADGTIMTIG RVCWDTGGID QTIVYNRSKK LGLFRVIPIK GASVYGKPVA NMPRKKNSHG VFLTEIGTDV AKEVIYSRYK LERSADGSPV PGLIHYPNNP AVFDLTEAEQ MTAEELIEKY EKGKIKLLWD AKKRRNEALD CFVYALAALR ISVSRWQLDL DVLLASRQQS PSGQQARNNN DLAALAAQLG G
|
| |