Gene Spro_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2042 
Symbol 
ID5607168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2236048 
End bp2237973 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content53% 
IMG OID640937580 
Productterminase GpA 
Protein accessionYP_001478273 
Protein GI157370284 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.324943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0895639 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATAT CGAACAAACG GATTGATCGG CTGCGTTATT GGGTTGCCGC CGGTCTGCGT 
TCGCTATTCC GCCCCGTTCC TATGACGGCG GTCGAATGGG CTAACGAATT TTATTACCTC
CCCAAAGAAT CCTCCTATCA AGAGGGGCGC TGGGAAACGA TGCCGTTTCA GGTTGCGATC
ATGAACGCAA TGGGGAGCGA TGACATCCGG GAGGTAAACC TGATTAAGTC GGCGCGCGTC
GGCTATTCAA AAATGTTGCT GGCCGTCGTG GCGTATTTTA TCCAGCACAA ACAGCGAAAC
GGGCTGTTGT GGCAGCCGAC GGATGGCGAC GCTGAAAATT TTATGAAGTC GCACGTCGAA
CCGACGATCC GCGACGTTCC CAGTCTCTTG GCAATGGCGC CCTGGTACGG AAAAAAACAC
CGCGATAACA CGCTTTCGAT GAAGCGTTTT TCGAACGGTC GGGGTTTCTG GTGCCTGGGA
GGTAAAGCGG CAAAAAACTA CCGTGAAAAA TCGGTCGATT ATGTTGGCTA TGACGAACTG
GCCGCCTTTG ATGAGGATGT GGAGAAAGAG GGTTCACCGA CGTTCCTGGG TGATAAGCGC
ATAGAAGGCT CAGTGTGGCC AAAGTCGATA CGCGGATCTA CACCGAAAAT CAGGGGCATA
TGCCAGATAG AACGCGCCGC CAGTGAATCC GGGCATTTGA TGCGTTTTCA TGTGAAATGC
CCACATTGCG ACGAGGAGCA GTTTTTAAAG TTCGGCGATC GTGAAACGCC ATACGGTTTT
AAGTGGGAAT CAGGGAAGCC GAAGACGGTT TTTTATCTCT GTGAGCATAA CGCCTGCGTG
ATACGCCAGC AGGAACTTAA TTTTAGTGGT GCGCGTTATA TCTGTGAAAA CACGGGCCTT
TACACGTCTG ATGGCCTCCG CTGGTTCGAA TCGACGGGGC AGGAGGTTGA TCCCCCTGAA
TCAGTATCCT TTCACATCTG GACCGCTTAC AGCTCGTTTA CTACCTGGGC GCAAATCGTT
AAGGACTTTA GAAAGACGAA GGGCGACCCG GGCAAGCTGA AAACGTTCAC CAACACGACG
CTAGGCGAAA CTTGGGCCGA GGAAGTGGGG GAGCGGCCGT TACCTGAAAC CCTGGTTGAA
CTGGCAGAAC ATTACCGGGC AGAGGTGCCC GATCGTGTGG TTTACCTCAC TGCCGGCATT
GACTCCCAGC TCGACCGTTA CGAAATGCGT GTTTGGGGCT GGGCGCCAGG TGAAGAGGCA
TTCCTGATCG ACCGCGTGAT TATCATGGGG CGGCACGATG AAGAGGAAAC GTTGCTGCGT
GTTGATGAGG CCATCAACAA ACAGTATCAG CTGGCCGACG GCACGATCAT GACCATTGGC
CGTGTTTGTT GGGACACCGG CGGTATCGAC CAGACGATAG TTTATAACCG CTCGAAAAAA
CTGGGCCTCT TTCGGGTAAT CCCGATCAAG GGCGCCAGCG TTTATGGGAA GCCAGTGGCC
AACATGCCGC GCAAGAAAAA CAGCCACGGC GTTTTTCTGA CAGAAATCGG CACTGACGTC
GCCAAAGAAG TCATTTACAG CCGCTACAAA CTGGAACGCT CCGCTGATGG CTCCCCCGTT
CCTGGGCTTA TCCACTACCC GAATAATCCG GCGGTTTTCG ACCTGACCGA AGCCGAGCAG
ATGACGGCAG AGGAACTCAT AGAAAAATAT GAGAAAGGGA AAATTAAATT GCTCTGGGAC
GCCAAAAAAC GCCGAAACGA GGCCCTCGAC TGTTTTGTTT ATGCCCTGGC GGCTTTGCGT
ATCAGCGTTT CGCGCTGGCA GCTGGATTTG GATGTGTTAC TGGCCAGCCG CCAACAATCA
CCGTCCGGCC AGCAGGCCAG AAATAATAAT GACTTGGCCG CCCTGGCGGC TCAATTGGGA
GGATAA
 
Protein sequence
MSISNKRIDR LRYWVAAGLR SLFRPVPMTA VEWANEFYYL PKESSYQEGR WETMPFQVAI 
MNAMGSDDIR EVNLIKSARV GYSKMLLAVV AYFIQHKQRN GLLWQPTDGD AENFMKSHVE
PTIRDVPSLL AMAPWYGKKH RDNTLSMKRF SNGRGFWCLG GKAAKNYREK SVDYVGYDEL
AAFDEDVEKE GSPTFLGDKR IEGSVWPKSI RGSTPKIRGI CQIERAASES GHLMRFHVKC
PHCDEEQFLK FGDRETPYGF KWESGKPKTV FYLCEHNACV IRQQELNFSG ARYICENTGL
YTSDGLRWFE STGQEVDPPE SVSFHIWTAY SSFTTWAQIV KDFRKTKGDP GKLKTFTNTT
LGETWAEEVG ERPLPETLVE LAEHYRAEVP DRVVYLTAGI DSQLDRYEMR VWGWAPGEEA
FLIDRVIIMG RHDEEETLLR VDEAINKQYQ LADGTIMTIG RVCWDTGGID QTIVYNRSKK
LGLFRVIPIK GASVYGKPVA NMPRKKNSHG VFLTEIGTDV AKEVIYSRYK LERSADGSPV
PGLIHYPNNP AVFDLTEAEQ MTAEELIEKY EKGKIKLLWD AKKRRNEALD CFVYALAALR
ISVSRWQLDL DVLLASRQQS PSGQQARNNN DLAALAAQLG G