Gene GYMC61_0202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_0202 
Symbol 
ID8524008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp210168 
End bp211355 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content53% 
IMG OID 
Producttransposase IS4 family protein 
Protein accessionYP_003251383 
Protein GI261417701 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGAT TAGCACATCA CCAAGGAATC CACAAGTTTT TCTTCACTTT GGGGTTGACG 
CTGCAGCTTT CCAAACCGGT CATCAAGCAT CTCATTCATA TCGTCGATGC CTTGACCACC
AAGGGATTCT CGGGAACATT GACCGATATT CATCACTGGA GCTTTCATCC GAATCATCGA
ACGACGCTCA GTCACTTTTT CACGAAAAGC CCTTGGGATG AGGAAAGGCT GCTTGGGAAG
CTTCAAGAGT GGATCCTCCG CCGAATCGAG CGCCTGGTCG AGCGAAAGAA TCAGCCTCTT
TTTGTCTCGA TTGATGATAC GATTTGCCAA AAAACGAAGC CTTCGTCACG GGCAACGCAC
GCCATTCAAG GGTGCGACTG GCACTACTCG CATAAAGATC ATCAATCGGT TTGGGGGCAT
TCGCTCGTTT GGCTGATGGT GCACACCTTG ACACAAGCGT TTCCGTTTGC GTTCCGCCTG
TATGACAAGA AAGCGGGAAA AAGCAAGATC GACCTGGCCA TCGAGATGCT TTCTTCGCTC
AAAGTGAAGC GGGCTCAGCC GGTGTATGTG CTCATGGATT CGTGGTATCC GTCCAAAAAG
CTCATTGAAG CCTGCTTGAA ACAGGGATTC CATGTCATCG CCATGCTCAA GACGAACCGG
ATTCTCTACC CGAAAGGCAT CGCCATCCAA GCCAAGCAGT TCGCCCGCTA TGTCGAGTCC
GAAGACACCC GCCTCGTCAC GGTGGGGAAG GAGCGTTACC GCGTGTATCG CTATGAGGGG
GCGATCCATG GCCTCGATGA CGCGGTGGTG CTGCTGGCTT GGAAGGCGGA TCAGCCGATG
GCGCCGGAAC ATCTCCATGT CGTCTTGAGC ACCGATCGGG AGCTGAGCGA CGAAGACATC
TTGCGTTACT ATGCTCAGCG TTGGACGATC GAGTGCTTTT TCCGGCAGGC GAAAGATCAA
CTGAAGCTTG ATGGATACCG CGTTCGCCAC ATTCGGGCGG TGAAACGGTA TTGGGCGGTG
GTGCTGTTGG CCTGCGTGTA TAGCATCGCC GAATCCCGAC AAAACCTCTC CGCCGGGCTG
GAGCTTCTTC GGTCGCGGAA AGACCACAGC GTCGTCGAGT TCATTTATGA CGCTGCGAAG
CAAGATATTC CCATTGATGT GATCAAAAAA CAGCTCCGTA TCGCGTAA
 
Protein sequence
MNRLAHHQGI HKFFFTLGLT LQLSKPVIKH LIHIVDALTT KGFSGTLTDI HHWSFHPNHR 
TTLSHFFTKS PWDEERLLGK LQEWILRRIE RLVERKNQPL FVSIDDTICQ KTKPSSRATH
AIQGCDWHYS HKDHQSVWGH SLVWLMVHTL TQAFPFAFRL YDKKAGKSKI DLAIEMLSSL
KVKRAQPVYV LMDSWYPSKK LIEACLKQGF HVIAMLKTNR ILYPKGIAIQ AKQFARYVES
EDTRLVTVGK ERYRVYRYEG AIHGLDDAVV LLAWKADQPM APEHLHVVLS TDRELSDEDI
LRYYAQRWTI ECFFRQAKDQ LKLDGYRVRH IRAVKRYWAV VLLACVYSIA ESRQNLSAGL
ELLRSRKDHS VVEFIYDAAK QDIPIDVIKK QLRIA