Gene GYMC61_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1954 
Symbol 
ID8525818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1971102 
End bp1972436 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content59% 
IMG OID 
Productsun protein 
Protein accessionYP_003253053 
Protein GI261419371 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTAC GTGAACTTGC TTTAGATACG CTGTTAGCCA TTGAACAAAA AGGAGCGTAC 
AGTCATTTGC AATTAAACGA AGCCATTCAA AAAGGACGCC TTGACGGGCG CGATGCGGCG
CTGTTGACCG AGATTGTCTA CGGCACGGTG CAACGGCGCG ACACGCTCGA TTATTATTTG
GCGCCGTTTT TGCGCAAGGC GCGCCGGCTT GAGCCGTGGG TGCGCGTGCT TCTTCGGTTG
ACGCTGTATC AAATGGTGTA TTTGGATCGT GTTCCCGACC GCGCCGCCGT CTTTGAAGCG
GTTGAAATCG CCAAACGGCG CGGCCATCGC GGCATTGCTT CACTTGTCAA CGGCGTATTG
CGGGCCATCG GCCGCGAGGG GCTGCCGTCG ATCGAGGCGG TTGATGACGC GGGCAAGCGG
CTCGCCCTTG CCACCAGCCA TCCCGAGTGG CTCGTTCGGC GCTGGATCCA GCAATATGGT
TATGAAGAAG CGGCGCGCAT GTGCGAGACG AATTTGCGCC CGCCGCAATC AACGGCCCGC
GTCAACCGGC TGCGTGCGAC TGTCGAAGAA GCGCTTGAGC GGCTGCGCGC TGAAGGGATG
CAGGTAATCC CTGGCCATGT CGCTCCGGAG GCCATCCGGG CGGAAAAAGG GAATTTGGCT
CATACGGAGA CGTTTCGCGC CGGTTGGCTG ACGATTCAAG ATGAAAGCTC AATGCTTGTC
GCTCGAGCGC TTGACCCAGC CCCCGGCGAG CGGGTGCTTG ACTGCTGCGC GGCGCCGGGC
GGGAAAACGA CCCATATCGC CGAGCGGATG GACGGGCGCG GCGAGGTCGT TGCTGTTGAT
ATCCACGAAC ATAAGGTAAT GCTGATCGAA CAGCAGGCCA AGCGGCTCGG GCTTGACAAC
GTTGCGACGC TCTCCCTTGA CAGCCGCCGG CTCGGCGAGC GGTTTGCCCC GGAATCGTTT
GACCGCGTTT TGGTCGATGC GCCGTGCACT GGATTCGGCG TCATTCGCCG CAAACCGGAG
ATCAAATATA CGAAAGGGAA AGACGCCATC GCCGCGCTTG TCGAGATTCA GCAAGCCATT
TTGCGGGCGG CCGCCCCGCT TTTGAAAAAA GGCGGTACGC TTGTCTACAG CACATGCACG
GTGGAGCGCG AGGAAAATGA AGAAGCCATC GCCCGCTTTT TAGCGGATCA TCCGGACTTT
TTCCTCGACG CCAGCCTTGC CGAGCGGATG CCGAAACCGG TGCGGCCGCA TGTGAAAGGC
GGCATGTTGC AGCTTTTGCC GCATCATTTT GACTCTGACG GGTTTTTTAT CGCCCGACTG
CGAAAGAGGG TGTAA
 
Protein sequence
MNVRELALDT LLAIEQKGAY SHLQLNEAIQ KGRLDGRDAA LLTEIVYGTV QRRDTLDYYL 
APFLRKARRL EPWVRVLLRL TLYQMVYLDR VPDRAAVFEA VEIAKRRGHR GIASLVNGVL
RAIGREGLPS IEAVDDAGKR LALATSHPEW LVRRWIQQYG YEEAARMCET NLRPPQSTAR
VNRLRATVEE ALERLRAEGM QVIPGHVAPE AIRAEKGNLA HTETFRAGWL TIQDESSMLV
ARALDPAPGE RVLDCCAAPG GKTTHIAERM DGRGEVVAVD IHEHKVMLIE QQAKRLGLDN
VATLSLDSRR LGERFAPESF DRVLVDAPCT GFGVIRRKPE IKYTKGKDAI AALVEIQQAI
LRAAAPLLKK GGTLVYSTCT VEREENEEAI ARFLADHPDF FLDASLAERM PKPVRPHVKG
GMLQLLPHHF DSDGFFIARL RKRV