Gene GYMC61_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3601 
Symbol 
ID8527488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013412 
Strand
Start bp41213 
End bp43261 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content42% 
IMG OID 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_003254627 
Protein GI261420946 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAGGG TAGCGGTCGA CGTGGGAGGT ACCTTTACTG ATGTTGTTTT ACAAAACGAA 
GAGACAGGTG AGATTTTTGT TACAAAGGTT CCTTCTACAC CATCTGACCA ATCAATCGGA
TTAATGGATG GAATACTTAA AATCTGTAGA GAAGCAGGTG TTAGTTTATC TGATATTAGG
ACGATTATAC ATGGTACAAC AGTGGCTACT AATGCAGTTT TAGAAGGAAA AGGTGCAAAG
GTTGGATTAA TAACAACTCA TGGATTTGAA CAAATTCTAC ATGTTGCCCG TTCTTGGACT
CCTGCTCCTG TTAGCGCATG GATTGGATTT ATTAAGCCTG ATCCACTTGC AGATCTGACT
AATACCCGTG GAGCATTAGA ACGGATTAGT GCACAGGGTG AAATAATACG TGAACTCGAT
GAAGATCATA TTCGGCGTCA AATTCAGGAA TTGTATGAAA AAAGTGTTGA AAGCTTAACG
ATTAGTTTAA TTAATTCCTA TGCGAACCCA GTACATGAAC AACGAATTAG GGAAATTGCG
ACAGAAATAA ACCCAGATAT TCCCGTTTCT ATATCTTATG AAATCTTACC GGAGTTTAGG
GAATATGAAA GAACGCTAAC AACCGTTATG AATTCTTATG TTCGACCGCC AATGCAAAAA
TATTTGAGAA ACATTGAAAA TAAGTTAAAG GAAAATCATA TGCGTAGCCG TGTTGGTATT
GTACGTTCAG ATGGAGGACT AATGAGCATT TCGGCAGCTG CTACGCGTCC CGTACACACA
ATGTTATCCG GACCTTCCGG TGGAGTAACT GCATCAGCTA TGATCGGTAT ACAGGCTGGA
TTTAGAAATG TAATATCTTT CGATATGGGG GGGACATCAA CGGATGTAGC GTTGACATAT
GACGGTAAAC CTAGGGTTTC AAGAGAAACC AAGGTTGGAA CCTTTCCAGT AAAAGCCCCT
TCACTCGAAG TTGTCAGCAT CGGGGCTGGG GGGGGATCGA TTGCGCATGT TCCCCCAACT
GGCGCGTTGC GCGTTGGACC AAAAAGTGCC GGAGCTGATC CCGGGCCCGC TTGTTATGGT
CGTGGTGGCG AAGAGCCAAC GGTAACGGAT GCAAATGTAG TTTTAGGATA CCTCCCTTCA
AGTTTAGTTG GTGGAGAAAT GAAACTTGAT GTGGAGGCAG CATTTAAGGC AGTTGGTAAG
ATTGCAGAAC GCCTAGGTAT AGATGTGTAT CGTGCAGCCA AGGGTATATA TGACATTGTT
AATGAGAATA TGTACGGTGC AATTCGAGTT GTATCAGTTG AAAAAGGTTA TGATCCACGT
GATTTCGCAT TGATTGCACT GGGAGGGGCA GGTCCTTTAC ATGCGAATGC GTTAGGGCGT
TTATCTGGTT CATTCCCTGT TATTATTCCA CCAACTCCTG GAGTCTTATC TGCATTAGGT
TTCTTACAAT CGGATATTCG TAATGAATAC TCCAAGACCT TTATTCGTAC TTTAAGTCAA
ATCGATGTGC GCTCACTAAT AAGAGAGTTA AACGAACTAG GAAAAGAAGC TGAAGAATGG
CTGATTCAAG AGTCAGTACC CAAGGATCAA CAGACAGTTT CCTTCGAAGT AGATGTACGT
TACTTTCGAC AAGGTTATGA AATTTCAATT CAGGTTGATA AACAGACCTT AATACAAAAT
GGTCTTGGAC TGCTGAAGAG TCAATTTGAT AAAATTCACG AGAAAATCTA CGGCTTTAAG
ATGGATATTG AACTTGAGAT TGTTAACCTG AGAGCAGTTG CTATAGGAAA AGTTACTTCT
CCATCGTTGC CTTCCAGTAG CCCTGGTAAT GAGGATGCGT CGCACGCCTT AATAGATAAG
GAACACAAGG CATTTTTTGA TGGCGAATTT TTACCTACAC CGTTATATGA TCGTGCGCTG
TTAAAACCCA ATAATAGAAT TCCGGGACCA GCGATCGTTA TCCAAAAAGA TAGTACAACA
TTAGTTTTGC CGGGTTATGT GGCGGTAGTT GATAACCATA TGAACCTTTT GATTAAAGAG
GAGGTTTAA
 
Protein sequence
MYRVAVDVGG TFTDVVLQNE ETGEIFVTKV PSTPSDQSIG LMDGILKICR EAGVSLSDIR 
TIIHGTTVAT NAVLEGKGAK VGLITTHGFE QILHVARSWT PAPVSAWIGF IKPDPLADLT
NTRGALERIS AQGEIIRELD EDHIRRQIQE LYEKSVESLT ISLINSYANP VHEQRIREIA
TEINPDIPVS ISYEILPEFR EYERTLTTVM NSYVRPPMQK YLRNIENKLK ENHMRSRVGI
VRSDGGLMSI SAAATRPVHT MLSGPSGGVT ASAMIGIQAG FRNVISFDMG GTSTDVALTY
DGKPRVSRET KVGTFPVKAP SLEVVSIGAG GGSIAHVPPT GALRVGPKSA GADPGPACYG
RGGEEPTVTD ANVVLGYLPS SLVGGEMKLD VEAAFKAVGK IAERLGIDVY RAAKGIYDIV
NENMYGAIRV VSVEKGYDPR DFALIALGGA GPLHANALGR LSGSFPVIIP PTPGVLSALG
FLQSDIRNEY SKTFIRTLSQ IDVRSLIREL NELGKEAEEW LIQESVPKDQ QTVSFEVDVR
YFRQGYEISI QVDKQTLIQN GLGLLKSQFD KIHEKIYGFK MDIELEIVNL RAVAIGKVTS
PSLPSSSPGN EDASHALIDK EHKAFFDGEF LPTPLYDRAL LKPNNRIPGP AIVIQKDSTT
LVLPGYVAVV DNHMNLLIKE EV