Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4922 |
Symbol | |
ID | 5602451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009829 |
Strand | - |
Start bp | 12199 |
End bp | 14049 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640930785 |
Product | terminase GpA |
Protein accession | YP_001471690 |
Protein GI | 157362831 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 65 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.0380287 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTACAA CCGTTTGGAG TGAGTCTTTT TTCCGCTCGC TGCGGCCGCG TTCCCGGTTA ACAGTTTCCG CCTGGGCGGA TAAATACCGC GAGGTAGCCC CCGGAACGTC GCCAGAACCG GGTAAGTGGC GTACATCGCG GGTGCCGTAT TTGCGGGAAC CGATGGATAT CATCGGTGAT GCTGAAACGG AAACCGTCGT TTTAATGTTC AGTTCGCAAG TGGCAAAGTC AGAACTGCAG CTGAATGTCA TGGGGTACTT CACCGACCAA GAACCCTCCC CGCAGCTTAT GGTTTATCCC ACGATTGACG CGGCGGAAGC ATTTTCAAAA GAGCGTATCG ACCCGACCTT TAAATACTCG CCCGGGCTAA AAAACAAACT CAGCGAAGGC CGCGAGGGAC GCGGTATTGC AAAAAAGACC AGTACCACTA TCCGCATGAA ACACTATGCC GGTGGTTATA TCGCGCTGGT GGGAGCCAAT GCGCCAGCCG GTCTGGCATC GCGTCCGATT CGTGTACTGC TCGCTGACGA AATCGACCGC TATACAGCGA CGGTGGAAGG GGATCCCCTG AAGCTGGCCA TTCAGCGTAC AACCAACTTT CACAACCGGA AAATCGTTCT TGCTTCAACG CCGGTGAAGA AAGAGACGAG CAAAATTTTC GAGTGGTTCA AAAAGTCCGA TCAGCGTCGG TATTACATTC CATGTCCGCA TTGTGGTTGC CTTCAGGTGC TCCGCTGGTC ACAGGTCAAA TGGGAAAAGA ACGATCTCGG TGAGGCATTG CCCGAAACCG CCCGCTATGA GTGCAAAGAA TGCGCCGGTC ACATATTGGG TTCAGGTAAA CCCGACCCCG AACTGATAGC CAAAGGGGTT TGGAAAGCCG ATAAACCGCA CGTGAAAAAA ATTGTCGGTT TTCACATCAA CAGCCTGTAT TCGCCGTGGG TAGCGCTATC TAGCCTTGTC GAAGAGTTTG CCGCAGCAAC AAAAAACCGA GACAAAACCG GCTTAATGGA GTTCATCAAC CTGAAGCTGG GCGAGCCTTG GGAAGAAGAT AAAAAAGGCG ATATTGATCA CGATTATCTT TTGCGTCGTC GCGAACGGTA CGCCGCTGAT TTGCCGATTG GTGTGTTGGT TCTAACTGCA GGCGTAGATA CGCAGGATAA ATATCTGGTG GTTGAAGTAA CCGGCTGGGG AAAAGGGAAA GAAAGCTGGG GCATTGAGTA CAAAATTTTC ATGGGTGATA CAAAACAACC GCAGGTGTGG AAAGAGCTTG ACGAGTATTT GCAGCGCAGT TGGTCATTTG AGGACGGTCG TCGCCTCTCG ATCGCCGCAA CATGCGTCGA CTCCGGTGGT CATGCCACAA CCGAGGTTTA TCAGTTTACT AAACCACGTG AATCACGACG CATTTTCTCC ATTCGTGGGC GCGGTGGTGT CGGGATCCCG TTCATTGGAA AGCCAAATAA CAATAACCGC GTTGGTGCGA TGTTATTCAA CCTCGGCGTT GATGATGGTA AAGGCACAAT CATGTCGCGC ATTAAACTGC ACGATGAAGG GCCAGGGTAT ATGCATTACC CGCTCGGTGA TAAAGGCTTT GATACTGAAT ATTTTAAAGG ATTGTTATCA GAGCGAAAAA TATACAAATA CGCTAAAGGC AAGACCCAAG AGGTATGGGA AAAAGTCTAC GAACGAAATG AACCGCTCGA CTGCCGCAAC TACAGCACCG CCGCACTAGA AATCCTCAAT CCCAATTTCG AATGGCTGGA ACAGCAAGTT CAGCTCGGCA ACGTTTATAT ACAACACCCG CAAGCACAGC GTCCTAAAGG TCGCCGCGTG ATGAGTAAGG GTGTGGGGTA A
|
Protein sequence | MGTTVWSESF FRSLRPRSRL TVSAWADKYR EVAPGTSPEP GKWRTSRVPY LREPMDIIGD AETETVVLMF SSQVAKSELQ LNVMGYFTDQ EPSPQLMVYP TIDAAEAFSK ERIDPTFKYS PGLKNKLSEG REGRGIAKKT STTIRMKHYA GGYIALVGAN APAGLASRPI RVLLADEIDR YTATVEGDPL KLAIQRTTNF HNRKIVLAST PVKKETSKIF EWFKKSDQRR YYIPCPHCGC LQVLRWSQVK WEKNDLGEAL PETARYECKE CAGHILGSGK PDPELIAKGV WKADKPHVKK IVGFHINSLY SPWVALSSLV EEFAAATKNR DKTGLMEFIN LKLGEPWEED KKGDIDHDYL LRRRERYAAD LPIGVLVLTA GVDTQDKYLV VEVTGWGKGK ESWGIEYKIF MGDTKQPQVW KELDEYLQRS WSFEDGRRLS IAATCVDSGG HATTEVYQFT KPRESRRIFS IRGRGGVGIP FIGKPNNNNR VGAMLFNLGV DDGKGTIMSR IKLHDEGPGY MHYPLGDKGF DTEYFKGLLS ERKIYKYAKG KTQEVWEKVY ERNEPLDCRN YSTAALEILN PNFEWLEQQV QLGNVYIQHP QAQRPKGRRV MSKGVG
|
| |