Gene Snas_5220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5220 
Symbol 
ID8886429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5546315 
End bp5548357 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content70% 
IMG OID 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_003513947 
Protein GI291302669 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.46382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.858688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCACG AGACCCGGGT GGCCATCGAC GTCGGCGGGA CGTTCACCGA CGTCGTCACG 
CTTCGGCCCG ACACCGGCGA GTTCCGGTTC GAGAAGGTCC CCACCACCCC CGAGGCCCCC
GCGCGCGGCG TCCTGGACGC CTTCGGCGCG GCCGAAGTGG ACATGCCGGA CGTGTCGATG
TTCAACCACG GCACCACCCT GGGACTGAAC TCGCTGCTGA CCCGCACCGG CGCCAAGGTG
GCCGTGGTCG GCACCCGGGG CTTTCGCGAC GTGTACCTGC TGGGCCGCAC CAACCGCGAC
GTCATGTACG ACATCGCCTA CCGCAAACCC GAACCGCTGC TGGAACGCTA CGACACCTTC
GAGGTCGCCG AACGGTCCTA TTTCGACGGC ACCGTCGCGA CCCCGCTGGA CGAGGCCGAC
GCGGCCCGCG TCGCCGCCCA GATCGGCGAG CGCGACTACC AGGCCGTCGC GGTGGCATTC
CTGCACTCCT ACGCCAATCC GGCGCACGAG ACCCGGATGC GCGAGATCCT GCTGGAACAC
TGTCCCGACG TCGAGGTGAC CGTCTCGCAC GAACTGTCCC GCGAGTACCG GGAGTACGAG
CGCACCTCCA CCGCGGTGCT GGACGCCTAC ATCAAGCCGA TCGTCCGGCG CTACCTCGCC
GAACTCGACG ATGGACTCAC CGACGCCGGA TTCGGCGGCC GGTTCCTGAT GTCGCGGTCC
GGTGGTGGCG CCATGACCGC CGAGGCCGCC CGGGAACAAC CGGTCAACCT GATCCTGTCG
GGCCCCGCCG GCGGCGTGGT CGGCGCGGCC GGGTTCGCGA AACTGTTGGG GCGCCCCAAT
CTCATCACCA TCGACATGGG AGGAACCAGC CTGGACGCGT CGCTGGTTCT GGACTCCACC
CCGGTCGCGC ACCAGGGCGC CGAGTTCGAG GGGATGCCCA TCAACACGCC CTCGCTGTAC
ATCCACACCA TCGGCTCGGG CGGCGGCTCC CTTGTGTACC TCGACGACGC CGGGGCGTTG
CAGGTCGGCC CGAAGAGCGC CGGGGCGGTA CCGGGTCCGG TGGCCTACGG TCGCGGCGGC
ACCCGGCCCA CCTTCACCGA CGCGGCGCTG GCCGTCGGTT ACCTCGGCGC CGACACGCCG
CTGGGCGGCA CACTCGCTCT CGACGCCGAC GGTGCCCGGG AAGCGTTGCG GCCCATCGCG
AACCAACTGA ACTACTCCAC CGAGGAACTC GCGCGCGGCG TCCTGCGCAT CACGAACACG
AAGATCATGG GCGCGGTACG GGCGATCACC GTGGAACTCG GCCACGACCC CAAGGACTTC
GCGCTGCTGT CCTTCGGCGG CGCCGGGGGA CTGGTCGCCG TCGACGTGGC CCGCGAACTG
GGCATCCCCG AGGTGGTCGT GCCGCCGGGA CAGGGCGCCT TCTCGGCGCT GGGCATGCTC
ATGGCCGACG TCCAGCACGA CCTGTCCCGC ACCGCCGTCA CCGCCCTGGC CGATGTGGAC
CTCGACGGGA TGGGCGCCGC CTACGCCGAC CTGGAGGCCG AGGCCGCCGT CCAGCTGGAA
CACGAGGGCT TCGCCCCCGA AGCCCGGCGC TACGAACGCA GCGTCGACGT GCGCTACAGC
GGCCAGGAAC ACTCGGTCAG CGTCGCGTTC CCCTCCGCTG TGGACGACAC GATCGCCGTG
ATCGAGGCCG AGTTCGCCGA AGCCCACCGA CGCCAGTACG GCCACGTCAT GGACGACCCG
GTCGAGATCA CGACACTGCG GCTGCGCGCC ACCGGCGTCG TCGACAAACC CGAACTCCCG
TTGGCGCCCA AACGAACCGG CGAACCACTG CGACCGCGCG GCAGTCGGGT GGTGCACGAG
ACCGACGGCT CCACCGCCGA CTACGCGCGC TACGCCCGCG AGGACTTCGC CGCCGGAGAC
GCCTTCACCG GACCGGCCGT GGTCACCGAG CACACCGCCA CGACGGTGCT GCACGACGGC
GACCGGCTCG ACGTCGGGCC GCACGGCGAA CTCGTCATCA CACTCGGAAG GGAAACGGCA
TGA
 
Protein sequence
MQHETRVAID VGGTFTDVVT LRPDTGEFRF EKVPTTPEAP ARGVLDAFGA AEVDMPDVSM 
FNHGTTLGLN SLLTRTGAKV AVVGTRGFRD VYLLGRTNRD VMYDIAYRKP EPLLERYDTF
EVAERSYFDG TVATPLDEAD AARVAAQIGE RDYQAVAVAF LHSYANPAHE TRMREILLEH
CPDVEVTVSH ELSREYREYE RTSTAVLDAY IKPIVRRYLA ELDDGLTDAG FGGRFLMSRS
GGGAMTAEAA REQPVNLILS GPAGGVVGAA GFAKLLGRPN LITIDMGGTS LDASLVLDST
PVAHQGAEFE GMPINTPSLY IHTIGSGGGS LVYLDDAGAL QVGPKSAGAV PGPVAYGRGG
TRPTFTDAAL AVGYLGADTP LGGTLALDAD GAREALRPIA NQLNYSTEEL ARGVLRITNT
KIMGAVRAIT VELGHDPKDF ALLSFGGAGG LVAVDVAREL GIPEVVVPPG QGAFSALGML
MADVQHDLSR TAVTALADVD LDGMGAAYAD LEAEAAVQLE HEGFAPEARR YERSVDVRYS
GQEHSVSVAF PSAVDDTIAV IEAEFAEAHR RQYGHVMDDP VEITTLRLRA TGVVDKPELP
LAPKRTGEPL RPRGSRVVHE TDGSTADYAR YAREDFAAGD AFTGPAVVTE HTATTVLHDG
DRLDVGPHGE LVITLGRETA