Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_3573 |
Symbol | |
ID | 8527460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013412 |
Strand | - |
Start bp | 11723 |
End bp | 13354 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | |
Product | Restriction endonuclease, type II, AlwI |
Protein accession | YP_003254599 |
Protein GI | 261420918 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000573232 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGA AAAACACCAG AAAGGTATGG TTTATTACGC GTCCGGAAAG AGACCCCCGC TTCCATCAGG AGGCATTGCT GGCTTTGCAG AAGGCGACGG ATGATTTCAG GCTGAAATGG GCGGGAAATA GAGAAGTCCA TAAACGGTAT GAAGAAGAGC TCGCCAATAT GGGGATCAAA AGAAACAACG TAAGCCATGA CGGCTCCGGA GGACGCACAT GGATGGCCAT GCTCAAAACC TTCTCCTATT GCTATGTAGA TGATGACGGA TATATTCGCC TTACAAAGGT CGGAGAAAAG CTCATCCAAG GAGAAAAGGT GTACGAAAAC ACAAGAAAAC AAGTCTTGAC GCTCCAATAT CCAAACGCCT ATTTTCTCGA ACCGGGATTC CGCCCAAAAT TCGATGAAGG GTTTCGGATC CGCCCTGTTC TGTTTCTGAT CAAATTGGCG AATGACGAAA GGCTGGACTT TTATGTAACA AAAGAGGAGA TCACTTATTT TGCCATGACA GCCCAAAAGG ACTCACAGCT GGATGAAATC GTGCATAAAA TATTGGCTTT CCGAAAAGCC GGCCCTCGCG AAAGAGAAGA AATGAAACAA GACATCGCCG CCAAGTTCGA CCATAGGGAA AGATCAGACA AAGGGGCGAG AGATTTTTAC GAAGCCCACT CCGACGTGGC CCACACTTTT ATGCTCATCA GCGATTACAC AGGATTGGTC GAGTACATTC GCGGCAAGGC CTTGAAAGGC GACTCTTCCA AGATCAACGA AATTAAGCAG GAAATCGCTG AAATCGAAAA ACGCTACCCA TTCAACACGC GATACATGAT ATCGCTTGAG CGCATGGCGG AAAACAGCGG CCTGGACGTC GACAGCTACA AGGCCAGCAG ATACGGCAAT ATAAAACCAG CTGCAAACTC CAGCAAGCTT CGGGCGAAAG CAGAGAGAAT CCTTGCTCAA TTTCCATCCA TAGAATCGAT GTCGAAAGAA GAAATCGCCG GGGCCCTCCA AAAATATTTA TCGCCAAGAG ACATTGAAAA GGTCATTCAT GAGATAGTGG AAAACAAAGA CGATTTTGAA GGGATCAACT CCGATTTTGT AGAGACCTAC TTGAATGAAA AGGACAACCT GGCGTTTGAA GACAAGACCG GCCAAATATT CAGCGCGCTG GGTTTTGACG TTGCAATGCG CCCTAAAGCC AAGAACGGGG AAAGAACAGA AATCGAAATC ATCGCCAGAT ACGGAGGAAG CAAATTCGGC ATCATTGACG CCAAAAACTA CGCAGGAAAG TTCCCGTTGT CCTCCTCCTT GGTTTCGCAT ATGGCGTCCG AATACATCCC CAATTACACG GGATATGAGG GCAAAGAGCT GACGTTTTTC GGCTATGTGA CCGCAAACGA CTTCAGCGGG GAGCGCAATC TAGAAAAGAT ATCGGACAAA GCCAAGCGAA TCACCGGAAA TCCCATCAGC GGATTTTTAG TCACAGCCAG AACATTGCTC GGCTTCCTTG ATTATTGCAT TGAGAACGAT GTGCCATTGG AAGACCGCGC CGAACTGTTT GTCAAGGCTG TCAAAAACAA AGGATACAAA TCGCTCGAGG CCTTGCTTCG GGAATTAAAA GAGACAATCT AG
|
Protein sequence | MNKKNTRKVW FITRPERDPR FHQEALLALQ KATDDFRLKW AGNREVHKRY EEELANMGIK RNNVSHDGSG GRTWMAMLKT FSYCYVDDDG YIRLTKVGEK LIQGEKVYEN TRKQVLTLQY PNAYFLEPGF RPKFDEGFRI RPVLFLIKLA NDERLDFYVT KEEITYFAMT AQKDSQLDEI VHKILAFRKA GPREREEMKQ DIAAKFDHRE RSDKGARDFY EAHSDVAHTF MLISDYTGLV EYIRGKALKG DSSKINEIKQ EIAEIEKRYP FNTRYMISLE RMAENSGLDV DSYKASRYGN IKPAANSSKL RAKAERILAQ FPSIESMSKE EIAGALQKYL SPRDIEKVIH EIVENKDDFE GINSDFVETY LNEKDNLAFE DKTGQIFSAL GFDVAMRPKA KNGERTEIEI IARYGGSKFG IIDAKNYAGK FPLSSSLVSH MASEYIPNYT GYEGKELTFF GYVTANDFSG ERNLEKISDK AKRITGNPIS GFLVTARTLL GFLDYCIEND VPLEDRAELF VKAVKNKGYK SLEALLRELK ETI
|
| |