Gene GYMC61_1926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1926 
Symbol 
ID8525790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1946626 
End bp1947813 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content53% 
IMG OID 
Producttransposase IS4 family protein 
Protein accessionYP_003253030 
Protein GI261419348 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGAT TAGCACATCA TCAAGGAATC CACAAGTTTT TCTTCGCATT GGGGTTGGCA 
TTGCATTTCT CGAAGCCGGT TCTGAAGCAT CTCGTTCATA TCGTCGATGC CTTGACCACC
AAGGGATTCT CGGGAACGCT GACGGATCTC CATCACTGGA GTTTTCATCC CAATCATCGA
ACAACGCTCA GCCATTTTTT CACGAAAAGC CCTTGGGATG AAGAGACGTT GCTTCGCAAA
CTCCAGCAAT GGATCCTTCA TCGCATCAAG CGAATCGCCA AACGGGAGAA TCAACCCCTT
TTTGTTTCGA TTGATGATGC CATTTGCCAA AAAACGAAGC CTTCGTCACG GGCTGCACAC
GCCATTCAAG GGTGTGATTG GCATTTCTCT CACAAAGATC ATCAATCGGT CTGGGGGCAT
TCGCTCGTTT GGCTGATGGT GCACACCATG ACTCAGGCAT TCCCGTTCGC GTTTCGTCTT
TATGATAAGA CGGCGGGAAC AAGCAAAGTC GACCTAGCGA TGGAGATGCT TTCTTCGCTC
AAAGTGAAGC GGGCTCAGCC GGTGTATGTG CTCATGGATT CGTGGTATCC GTCCAAAAAG
CTCATTGAAG CCTGCTTGAA ACAGGGATTC CATGTCATCG CCATGCTCAA GACGAACCGG
ATTCTCTACC CGAAAGGCAT CGCCATCCAA GCCAAGCAGT TCGCCCGCTA TGTCGAGTCC
GAAGACACCC GCCTCGTCAC GGTGGGTCAG GAGCGTTATC GCGTGTATCG CTATGAGGGG
GCGATCCATG GCCTCGATGA CGCGGTGGTG CTGCTGGCTT GGAAGGCGGA TCAGCCGATG
GCGCCGGAAC ATCTCCATGT CGTCTTGAGC ACCGATCGGG AGCTCGGGGA CGAAGACATC
TTGCGTTACT ATGCTCAGCG TTGGACGATC GAGTGCTTTT TCCGGCAGGC GAAAGATCAA
CTGAAGCTGG ATGGATACCG CGTTCGCCAC ATTCGGGCGG TGAAACGGTA TTGGGCGGTG
GTGCTCTCCT CCTGCGTGTA CAGCATAGCG GAATCCCAGC AAGACCTCTC CACCGGATTG
GAGCTTCTTC GATCGCGGAA AGGCCACAGC GTCGTCGAGT TCATTTATAA CGCCGCGAAT
CAAGAGATTC CCATTGATGT GATCAAAAAA CAGCTCCATG TCGCCTAA
 
Protein sequence
MNRLAHHQGI HKFFFALGLA LHFSKPVLKH LVHIVDALTT KGFSGTLTDL HHWSFHPNHR 
TTLSHFFTKS PWDEETLLRK LQQWILHRIK RIAKRENQPL FVSIDDAICQ KTKPSSRAAH
AIQGCDWHFS HKDHQSVWGH SLVWLMVHTM TQAFPFAFRL YDKTAGTSKV DLAMEMLSSL
KVKRAQPVYV LMDSWYPSKK LIEACLKQGF HVIAMLKTNR ILYPKGIAIQ AKQFARYVES
EDTRLVTVGQ ERYRVYRYEG AIHGLDDAVV LLAWKADQPM APEHLHVVLS TDRELGDEDI
LRYYAQRWTI ECFFRQAKDQ LKLDGYRVRH IRAVKRYWAV VLSSCVYSIA ESQQDLSTGL
ELLRSRKGHS VVEFIYNAAN QEIPIDVIKK QLHVA