Gene GYMC61_3573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3573 
Symbol 
ID8527460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013412 
Strand
Start bp11723 
End bp13354 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content47% 
IMG OID 
ProductRestriction endonuclease, type II, AlwI 
Protein accessionYP_003254599 
Protein GI261420918 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000573232 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA AAAACACCAG AAAGGTATGG TTTATTACGC GTCCGGAAAG AGACCCCCGC 
TTCCATCAGG AGGCATTGCT GGCTTTGCAG AAGGCGACGG ATGATTTCAG GCTGAAATGG
GCGGGAAATA GAGAAGTCCA TAAACGGTAT GAAGAAGAGC TCGCCAATAT GGGGATCAAA
AGAAACAACG TAAGCCATGA CGGCTCCGGA GGACGCACAT GGATGGCCAT GCTCAAAACC
TTCTCCTATT GCTATGTAGA TGATGACGGA TATATTCGCC TTACAAAGGT CGGAGAAAAG
CTCATCCAAG GAGAAAAGGT GTACGAAAAC ACAAGAAAAC AAGTCTTGAC GCTCCAATAT
CCAAACGCCT ATTTTCTCGA ACCGGGATTC CGCCCAAAAT TCGATGAAGG GTTTCGGATC
CGCCCTGTTC TGTTTCTGAT CAAATTGGCG AATGACGAAA GGCTGGACTT TTATGTAACA
AAAGAGGAGA TCACTTATTT TGCCATGACA GCCCAAAAGG ACTCACAGCT GGATGAAATC
GTGCATAAAA TATTGGCTTT CCGAAAAGCC GGCCCTCGCG AAAGAGAAGA AATGAAACAA
GACATCGCCG CCAAGTTCGA CCATAGGGAA AGATCAGACA AAGGGGCGAG AGATTTTTAC
GAAGCCCACT CCGACGTGGC CCACACTTTT ATGCTCATCA GCGATTACAC AGGATTGGTC
GAGTACATTC GCGGCAAGGC CTTGAAAGGC GACTCTTCCA AGATCAACGA AATTAAGCAG
GAAATCGCTG AAATCGAAAA ACGCTACCCA TTCAACACGC GATACATGAT ATCGCTTGAG
CGCATGGCGG AAAACAGCGG CCTGGACGTC GACAGCTACA AGGCCAGCAG ATACGGCAAT
ATAAAACCAG CTGCAAACTC CAGCAAGCTT CGGGCGAAAG CAGAGAGAAT CCTTGCTCAA
TTTCCATCCA TAGAATCGAT GTCGAAAGAA GAAATCGCCG GGGCCCTCCA AAAATATTTA
TCGCCAAGAG ACATTGAAAA GGTCATTCAT GAGATAGTGG AAAACAAAGA CGATTTTGAA
GGGATCAACT CCGATTTTGT AGAGACCTAC TTGAATGAAA AGGACAACCT GGCGTTTGAA
GACAAGACCG GCCAAATATT CAGCGCGCTG GGTTTTGACG TTGCAATGCG CCCTAAAGCC
AAGAACGGGG AAAGAACAGA AATCGAAATC ATCGCCAGAT ACGGAGGAAG CAAATTCGGC
ATCATTGACG CCAAAAACTA CGCAGGAAAG TTCCCGTTGT CCTCCTCCTT GGTTTCGCAT
ATGGCGTCCG AATACATCCC CAATTACACG GGATATGAGG GCAAAGAGCT GACGTTTTTC
GGCTATGTGA CCGCAAACGA CTTCAGCGGG GAGCGCAATC TAGAAAAGAT ATCGGACAAA
GCCAAGCGAA TCACCGGAAA TCCCATCAGC GGATTTTTAG TCACAGCCAG AACATTGCTC
GGCTTCCTTG ATTATTGCAT TGAGAACGAT GTGCCATTGG AAGACCGCGC CGAACTGTTT
GTCAAGGCTG TCAAAAACAA AGGATACAAA TCGCTCGAGG CCTTGCTTCG GGAATTAAAA
GAGACAATCT AG
 
Protein sequence
MNKKNTRKVW FITRPERDPR FHQEALLALQ KATDDFRLKW AGNREVHKRY EEELANMGIK 
RNNVSHDGSG GRTWMAMLKT FSYCYVDDDG YIRLTKVGEK LIQGEKVYEN TRKQVLTLQY
PNAYFLEPGF RPKFDEGFRI RPVLFLIKLA NDERLDFYVT KEEITYFAMT AQKDSQLDEI
VHKILAFRKA GPREREEMKQ DIAAKFDHRE RSDKGARDFY EAHSDVAHTF MLISDYTGLV
EYIRGKALKG DSSKINEIKQ EIAEIEKRYP FNTRYMISLE RMAENSGLDV DSYKASRYGN
IKPAANSSKL RAKAERILAQ FPSIESMSKE EIAGALQKYL SPRDIEKVIH EIVENKDDFE
GINSDFVETY LNEKDNLAFE DKTGQIFSAL GFDVAMRPKA KNGERTEIEI IARYGGSKFG
IIDAKNYAGK FPLSSSLVSH MASEYIPNYT GYEGKELTFF GYVTANDFSG ERNLEKISDK
AKRITGNPIS GFLVTARTLL GFLDYCIEND VPLEDRAELF VKAVKNKGYK SLEALLRELK
ETI