Gene GYMC61_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2022 
SymbolflhA 
ID8525886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2030815 
End bp2032860 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content57% 
IMG OID 
Productflagellar biosynthesis protein FlhA 
Protein accessionYP_003253120 
Protein GI261419438 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGCGC AAATCAAAGA TTTATCCGTA TTGTTTCTTG TCGTGCTCAT CGTCGCCATG 
CTCGTCATCC CGCTGCAGAC ATGGCTGTTG AGCGTCTTGA TCATCATCAA TATTTCCCTT
GCCTTGCTTG TATTGCTGAC GGCGATGAAC ACGAAAGAGC CGCTTGAGTT TTCCATTTTT
CCGTCATTGC TTTTAGTGCT GACGCTGTTT CGGCTCGGGC TGAATGTGTC AACCACCCGC
TCGATTTTAA GCAAAGGGGA GGCGGGCGGC GTCGTTGAAA CGTTCGGAAC GTTCGTCGTT
GGCGGCGATG TCGTCGTTGG GTTTGTCGTG TTTTTGATTT TGGTCATCAT CCAGTTTGTT
GTCATCACCA AAGGGGCGGA GCGCGTCTCG GAAGTGGCGG CCCGCTTTAC GCTCGATGCG
ATGCCCGGCA AACAGATGAG CATTGACGCC GATTTGAACG CCGGCATGAT TTCGGAGCAA
GAAGCCCGAA AGCGGCGGGA AAAAGTGGCG CAAGAAGCCG ACTTTTACGG AGCGATGGAC
GGGGCAAGCA AATTTGTGAA AGGCGACGCC ATCGCCGGCA TCATTATCGT CGTCATCAAC
ATGTTGTTTG GCATGGTGAT CGGCGTTGTG GAGAAAGGAA TGGATATAGG GGAGGCGGCA
AAGCGCTACA CGCTTTTGAC GGTCGGCGAC GGCATCGTCA GCCAAATTCC GGCGCTGTTG
ATTTCCACGG CGACCGGCAT CATCGTCACC CGGGCGGCGT CGGACAGCAA CTTGAGCGGC
GATATTATGC GCCAGCTGTT CGCGTTTCCG AAAATGTTGT ACGTCACGGC AGGAACGATT
TTCCTGCTCG GCTTGTTTAC GCCGATCAAC GACTTGTTGA CGATGCCGAT TGCCGGGCTG
CTGGCGCTTG GCGGCTACCG ATTTATCGAG CGGCAAAAGC AAGAAGAAGC GGCATTGGCC
GCTCCGGAAG AGGAGGCGGC GGCCGCCGAC GAATTGAAAA GCCCGGAAAG CGTCATCCAA
CTGCTGCATA TTGATCCGAT TGAATTTGAG TTCGGCTATG CGCTCATTCC GCTCGCTGAT
GCCAATCAAG GCGGGGATTT GCTTGACCGG ATCGTCATGA TTCGCCGTCA GCTCGCGCTC
GAGCTTGGCA TTGTCATCCC GGTTGTGCGC ATTCGCGACA ACATCCAGTT GCAGCCGAAT
GAATACCGGA TCAAAGTGAA AGGCGAGGAA GTGGCGCGCG GCGAGCTGCT GCTTGACCAT
TATTTGGCGA TGAGCCCCGG AGTTGACGAT GATTCGATTG ATGGCATTGA CACCATCGAG
CCGGCGTTTG GTCTGCCGGC GAAATGGATT TCCGAGACCG TAAAAGACCG GGCGGAAATG
CTTGGCTATA CAGTCGTCGA CCCGCCATCG GTCGTCTCGA CACACTTAGC GGAGGTGCTG
AAAGCCCATG CCCACGAACT GCTGGGGCGT CAGGAAACAA AACAGCTGAT CGATCATTTA
AAAGAATCGT ATCCGGTGCT TGTCGACGAT GTGACGCCGA ATCCGTTGTC TGTCGGCGAC
GTGCAAAAAG TGCTCGCCAA GCTGCTGAAA GAGAAAGTGT CGATCCGCAA CTTGCCGCTC
ATTTTTGAGG CGCTTGCCGA TTTTGCCCGC CTGACCAGCG ACACCGATTT GTTGACGGAA
TACGTCCGCC AAGCGTTGGC GCGGCAAATC ACCTCGCAGT ACGCCGTGCC GGGCGAGCCG
CTGCGCGTCA TTACGCTGTC GGGCAGGGCG GAAAAAACGA TCGCCGACGC CGTGCAGCAA
ACCGAACACG GCCGCTATTT GGCGCTCGAG CCGGCGCGGG CGCAGGCGTT TGTGGAAGCG
GTCGCCGCGG CGCTTGAACG CTATCCGTTC GCCGGCCAGA CGCCGATTTT GCTCTGCTCC
CCGGCGGTGC GCATGTACGT CCGCCAGCTG ACCGAGCGCC ATTTTCCGAC TGTTCCGGTG
CTGTCGTACA ACGAGCTTGA AGCGGATGTC GAAGTTCAAA GCGTGGGGAT GGTGGAAATC
GAATGA
 
Protein sequence
MQAQIKDLSV LFLVVLIVAM LVIPLQTWLL SVLIIINISL ALLVLLTAMN TKEPLEFSIF 
PSLLLVLTLF RLGLNVSTTR SILSKGEAGG VVETFGTFVV GGDVVVGFVV FLILVIIQFV
VITKGAERVS EVAARFTLDA MPGKQMSIDA DLNAGMISEQ EARKRREKVA QEADFYGAMD
GASKFVKGDA IAGIIIVVIN MLFGMVIGVV EKGMDIGEAA KRYTLLTVGD GIVSQIPALL
ISTATGIIVT RAASDSNLSG DIMRQLFAFP KMLYVTAGTI FLLGLFTPIN DLLTMPIAGL
LALGGYRFIE RQKQEEAALA APEEEAAAAD ELKSPESVIQ LLHIDPIEFE FGYALIPLAD
ANQGGDLLDR IVMIRRQLAL ELGIVIPVVR IRDNIQLQPN EYRIKVKGEE VARGELLLDH
YLAMSPGVDD DSIDGIDTIE PAFGLPAKWI SETVKDRAEM LGYTVVDPPS VVSTHLAEVL
KAHAHELLGR QETKQLIDHL KESYPVLVDD VTPNPLSVGD VQKVLAKLLK EKVSIRNLPL
IFEALADFAR LTSDTDLLTE YVRQALARQI TSQYAVPGEP LRVITLSGRA EKTIADAVQQ
TEHGRYLALE PARAQAFVEA VAAALERYPF AGQTPILLCS PAVRMYVRQL TERHFPTVPV
LSYNELEADV EVQSVGMVEI E