Gene Strop_0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0547 
Symbol 
ID5056986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp618800 
End bp620524 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content61% 
IMG OID640472818 
ProductPhage terminase protein large subunit-like protein 
Protein accessionYP_001157408 
Protein GI145593111 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAGAT CCGCGATGAC GCGTGAGGAG ATCGACGCAC TCCAACCCAC ATATCACGGC 
ATGACGTGGC GCCACAATGG CCCGCTCTGT CACGCCGCTG GCCTCACCAA GTGCGGATTC
GGTTGTGACG ACTGGCTACT CCCTGAGCGC ACGTTGGGTT GGGACGTGAT TCGGTGGGTT
GGGGAATACC TCCTGGACGA GGAGGGGCAG CGCTTCAGGC TGACGGCAGA GCAGCGCCGC
TTCCTGCTCT GGTGGTTCGC GATAGATGAC GCTGGACGCT TCGCCCATCG GACGGGCGTT
CTCCAACGCC TAAAAGGGTG GGGCAAAGAC CCCGTTGGCG CGATTCTGTG CCTAGTCGAG
TTCGTGGGGC CGGCACGATT CGCGCGGTGG GCCACCGGGG AGGACGCCAA GCGCAACCCC
CGGTTGCGCC CGGGGCGGGA CGCAATCGGG CGGCAGATGC CTGCCGCATG GGTCCAGATT
GCCGCCGTTA GCCGCGAGCA AACGAAGAAC ACCATGACGC TGCTGCCGGT GATGATGAGC
AGGCGGCTAA TCCAGGACTA CGGCGTTAAG CCAGGTAATG AGCTGATTCG GGCCGACCGG
GGTCGACGGA GAATTGAGGC GGTGACCTCA AACCCTCGCA CACTCGAAGG CGGCCGAAGC
ACCTTTGTGC TGCTAAACGA GACGCATCAC TGGATCAAGG GCAACAACGG TCACGCCATG
TACGAGACCA TTGATGGAAA CACCACGAAA AAGGATTCAC GGTATCTAGC CATCACGAAT
GCTTATTTGT CTGGTGAGGA TTCTGTTGCC GAGCGGATGC GCCTAGCGTA CGAGGATATC
TGTGATGGCC TGGCGCCGGA CGTTGGCCTC TACTACGACT CAGTAGAAGC GGACGCGCAG
ACACCGCTGA CAATTGACGG CCTGGAGGTC ACGCTCCCGA AGATCCGTGG TGATGCGGTC
TGGCTACGGG TCGATACGAT CATCAAGTCG ATTCAAAACA AGACGCTGTC GCCCAGTCGG
TCGCGGCGAA TGTGGCTGAA CCAAATCGAT GCCGTCGAAG ACGCCGTTTA CAGGATCGAG
GACCTGAAGG CTATCGAGCG CGCTGACGCC GAGCTGAAGG TGGGGGACGA AGTTGTCCTT
GGCTTCGATG GCGGTAAGAC GGATGACTCG ACCGCTCTCG TAGCCATTCG GCTTTCGGAC
GCGTGCGCCT TTTTGCTCGC TGTCTGGGAG CGGCCCGCGC GGTGGCCCGA AGACGAGCCC
TGGATGGTCC CCCACGAGCG CGTTGACTCG GAGGTGCACG ACACATTCCG CCTCTACAAG
GTCAAGGCAT TCTACGCTGA TGTCAGCTAC TGGGAGAGCT ACATCTCCCT GTGGAATAAA
GCTTATGGGC AAGGGCTGAC TCGAAAAGCC AGTCCGGATA GTCCGATCGG CTGGGACATG
CGGTCCCAGA AGCGCGCCAC GTTGGCGCAC GAGCGGCTAA TGGACGCTAT CGCGAGGCAG
AACATCCATT TCGACGGCGA TGCGACGTTG CGCCGGCATG CCGGCAATGC GCGACGGCGA
ACGAACAGCC ATGGCGTGAG CTTCGGCAAG GAGGGCGCGA AGTCGCAGCG CAAAGTTGAC
GCTTATGCGG CCTGGCTGCT GGCCCATGAG GCGATGTGCG ATCTCCGCAA CATGACCACC
AAGCAGGAAG AACGCTCGCG CTCTACCGAA ATGTGGGCCT ACTAG
 
Protein sequence
MVRSAMTREE IDALQPTYHG MTWRHNGPLC HAAGLTKCGF GCDDWLLPER TLGWDVIRWV 
GEYLLDEEGQ RFRLTAEQRR FLLWWFAIDD AGRFAHRTGV LQRLKGWGKD PVGAILCLVE
FVGPARFARW ATGEDAKRNP RLRPGRDAIG RQMPAAWVQI AAVSREQTKN TMTLLPVMMS
RRLIQDYGVK PGNELIRADR GRRRIEAVTS NPRTLEGGRS TFVLLNETHH WIKGNNGHAM
YETIDGNTTK KDSRYLAITN AYLSGEDSVA ERMRLAYEDI CDGLAPDVGL YYDSVEADAQ
TPLTIDGLEV TLPKIRGDAV WLRVDTIIKS IQNKTLSPSR SRRMWLNQID AVEDAVYRIE
DLKAIERADA ELKVGDEVVL GFDGGKTDDS TALVAIRLSD ACAFLLAVWE RPARWPEDEP
WMVPHERVDS EVHDTFRLYK VKAFYADVSY WESYISLWNK AYGQGLTRKA SPDSPIGWDM
RSQKRATLAH ERLMDAIARQ NIHFDGDATL RRHAGNARRR TNSHGVSFGK EGAKSQRKVD
AYAAWLLAHE AMCDLRNMTT KQEERSRSTE MWAY