Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_0625 |
Symbol | |
ID | 8524448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 616201 |
End bp | 617442 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | |
Product | phage portal protein, HK97 family |
Protein accession | YP_003251785 |
Protein GI | 261418103 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGGTTTG TTAGAAGCGC GTTAAAAAGA AGGGTAAAAA ACGAGAGCCA GACGGTAGAT TTAAACAATC CTCTTTTATT GCAATGGCTC GGCATTGATC CGGATACACC AAAAGATCAA TTGGCTGAGG CAACATATTT CGCTTGTTTA AAAATCCTGT CCGAGAGCTT GGGAAAATTA CCTCTAAAGA TGTATCAACG CACCGATCGC GGCATCGTGA AAAGCGATAA AGAGGATGTC TACTACATCT TAAAACTCCG GCCGAACCCG TACATGACAA GCAGCGTCTT TTGGTCAACG GTGGAAATGA ACCGGAACCA CTACGGCAAC GCGTATGTAT GGTGCCGGTA CGACGGACCG GTACTGCAAG ATATGTGGAT AATGCCAAGC AAGCACGTCG TCATCGTTGT GGACGATCAA GGGATTTTAG GGAAAAAGAA CGCAATATGG TATCGATACA ACGACCCGTA TGACGGCAAG CTATATGTCT TTGGCAACGA TGAGGTGTTA CATTTTAAGA CGTCGGCGAC GTTTGACGGC ATCACCGGCA TGTCAGTTCG GGACATCTTG AAAAACACGG TGGACGGCGC GCTGGAAAGC CAAAAATTCA TGAACAACCT TTACAAAACA GGATTGACAG GTAAAGCGGT GCTCGAATAT ACAGGTGATT TAGACCCCTC TGCACGCGAT CGCCTTGTAA AAGGGTTCGA GCAGTTCGCA AACGGCTCGA AAAACGCCGG AAAGATCATC CCGGTGCCGT TGGGGATGAA GTTGGTGCCG TTAGACATTA AGCTGACCGA TAGCCAGTTT TTTGAGTTAA AGAAATACAC CGCCCTGCAA ATCGCGGCGG CGTTTGGAAT CAAACCAAAT CAAATCAACG ATTATGAGAA GTCAAGTTAT GCGTCAGCAG AAGCGCAGAA CTTGGCTTTT TATGTGGACA CGCTGCTCTA TATCCTCAAG CAATACGAGG AAGAAATCAC ATACAAGATT TTGAGTACCC AGATGATTAA TCAAGGGTAC TTTTTTAAGT TCAATGTCAA CGTCATTCTA CGAGCGGATA TTAAGACGCA GATTGAAAGT TTAGCAACAG CGGTTCAAAA TGCGATTTTG AAACCGAATG AGGCGCGTGA TTATATTGAC ATGCCTGCTG ATGATTACGG CGATGTGTTA ATGGCCAATG GAAACTACAT TCCGTTGAGT ATGCTTGGCG CGAATTATGG AGTGAAAGGA GGTGAAGGTT GA
|
Protein sequence | MGFVRSALKR RVKNESQTVD LNNPLLLQWL GIDPDTPKDQ LAEATYFACL KILSESLGKL PLKMYQRTDR GIVKSDKEDV YYILKLRPNP YMTSSVFWST VEMNRNHYGN AYVWCRYDGP VLQDMWIMPS KHVVIVVDDQ GILGKKNAIW YRYNDPYDGK LYVFGNDEVL HFKTSATFDG ITGMSVRDIL KNTVDGALES QKFMNNLYKT GLTGKAVLEY TGDLDPSARD RLVKGFEQFA NGSKNAGKII PVPLGMKLVP LDIKLTDSQF FELKKYTALQ IAAAFGIKPN QINDYEKSSY ASAEAQNLAF YVDTLLYILK QYEEEITYKI LSTQMINQGY FFKFNVNVIL RADIKTQIES LATAVQNAIL KPNEARDYID MPADDYGDVL MANGNYIPLS MLGANYGVKG GEG
|
| |