Gene Nmag_3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3957 
Symbol 
ID8826827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp365616 
End bp367640 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content59% 
IMG OID 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_003482058 
Protein GI289583648 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCGTG CTGGTGTAGA CATTGGCGGA ACGTTCACTG ACGTGATCGT CTTCGATGAG 
AAAACCGGTG AGATCGAGAT CGAGAAGACG CCCTCAACGC CGGACAACCC GGCAGAAGGG
GTAATAAACG GGTTGACCAA GGCAGATACT GAGATCGCTG ACCTCGAGTT CTTCTCGCAC
GGATCGACGG TCGGGACGAA CGCGTTGATC GAACGCGAGT TGCCGCGAAT TGGACTAATT
ACGACAGACG GGTTCCGAGA CGTTCACGAA ATCCGTGATG CGACGAAAGA AGACCTCTGG
GACGCGTATC AGGATGTTGC GGATCCATAC GTTCAGCGGC GGGATCGACT CGAGGTCCCC
GAGCGAATCG ATTATGCAGG AAACGTCGTC GAACCGCTTG ACGAGGATGC CGTCCGCGAT
GTCGTTCGAA TCTTCGAGAA ACGCGGAATC GATACGATCG CAATTTCGTT GATTAACGCG
TACGTCAACG GTGAACACGA GAAACGAGTC GAGGAAATCG TCGCGGAAGA GTATCCCGAG
GCGTTCGTCT GTAGTTCCCA CGAGATCCTT CCGGAGATGT TCGAACACGA GCGGACGAGT
ACGACGGTCA TCAACGCGGC GCTGGTGCCG GTCGTTCGTG ACTATCTGAC CGATCTTGCA
GATCAACTCG CAGACCGGGG CTACGACGGC GACGTGCTCG CGATGCACTC CGGTGGTGGG
GTGATGACGA CCGAGGCGAT TGCCTACTAC GCGGCTCGTA TCGCAAACTC GGGGCCGACG
GCAGGCGCAA TCGCTGGCCG GTACATCGCC CAGCAGTGTG GCTTCGAGAA CGCGATCGGG
TTCGACATGG GTGGGACGAG TGCGGATGTC TCGGTCACGC ACGAGGGGGA GGTAGAGACG
ACCGACGAGT GGGCTGTTGA GTACGGCTAT CCGATCATGT TCCCCTCGAC GGATATCGAG
ACGATCGGGG CTGGCGGGGG GTCCATTGCC TGGATCGATG ACGGCGGCTC ACTCCGTGTT
GGTCCGAAGA GCCAGGGGGC GGACCCGGGA CCAGCCTGCT ATCTTCGCGG CGGCGACAAG
CCGACGACGA CGGACGCGAA CGTCGTTCTC GGCTGGGTCG ACCCGGATCA GTTCCTCGGT
GGTGACATGG ACGCGACTGC CGAACCCGCT CGTGAGGTCA TTCAGCGCGA TATCGCCGAG
CCACTCGACC TCGAACTCAC TGAGGCGGCC TCGGCAATCG AGCAGATCGC TGTCGCCAAT
ATGTGCAACG CCGTCCGTCT CGTTTCGACG AGCAAGGGCT ACGATCCTCG TGACTTCGCG
CTTGTCGCGT TTGGCGGTGC AGGTCCACTG CACGCAGCCC ACGTCGCGCG TGAGATGAAT
ATCCCTAACG TGATCATCCC ACCGTATCCG GGGATCAACT CGGCACTCGG CTGTCTGTTG
GTCGACGTTG AACACGACCT CTCACAGACG TTCATCGCTG ACACCTCAAA CGATGTTGTC
GACGACATCG AGTCGGCGTT CGCGGAGATG GAAGATGAAA TCCACGAACG ACTGGACGAG
GAGGGTGTCG ACGAGACCGA CATTCAACTC GATCACGAAA TCAAGATGCG CTACACTGGC
CAGTGGCGCT CACTTGAGGT CAGTTGCTCA CGTCCGATCG AGAGTATGGC AGAGATCAGG
TCCCAGTTCC ACAGCCAGCA CGAACAGACC TATGCTTACT CCGATACAGA TCAGCCCGTC
GAGATCTACG GGCTTCACGT CACTGGTCGC GGTGTTGTCG AGAAACCAGC CTTCCCCGAA
ATTGAGGACG GGAACGCCGA CGCCGCTCGA CGGACGACGC GCGAAGCGTA CTTCGACTCG
GAAGGCGAGT TCGTCGAGAC GGCTGTCTAC GACCGGGCCG AACTCGGGGC CGGCGCAACG
CTCGAGGGAC CGGCGATTAT CGAACAGATG GATTCGACCG TGGTCGTCCC ACCGAACGTG
ACGGCCGAAG TCGAGCAAAC GGGGAATATC ATTCTCACGG TGTAA
 
Protein sequence
MRRAGVDIGG TFTDVIVFDE KTGEIEIEKT PSTPDNPAEG VINGLTKADT EIADLEFFSH 
GSTVGTNALI ERELPRIGLI TTDGFRDVHE IRDATKEDLW DAYQDVADPY VQRRDRLEVP
ERIDYAGNVV EPLDEDAVRD VVRIFEKRGI DTIAISLINA YVNGEHEKRV EEIVAEEYPE
AFVCSSHEIL PEMFEHERTS TTVINAALVP VVRDYLTDLA DQLADRGYDG DVLAMHSGGG
VMTTEAIAYY AARIANSGPT AGAIAGRYIA QQCGFENAIG FDMGGTSADV SVTHEGEVET
TDEWAVEYGY PIMFPSTDIE TIGAGGGSIA WIDDGGSLRV GPKSQGADPG PACYLRGGDK
PTTTDANVVL GWVDPDQFLG GDMDATAEPA REVIQRDIAE PLDLELTEAA SAIEQIAVAN
MCNAVRLVST SKGYDPRDFA LVAFGGAGPL HAAHVAREMN IPNVIIPPYP GINSALGCLL
VDVEHDLSQT FIADTSNDVV DDIESAFAEM EDEIHERLDE EGVDETDIQL DHEIKMRYTG
QWRSLEVSCS RPIESMAEIR SQFHSQHEQT YAYSDTDQPV EIYGLHVTGR GVVEKPAFPE
IEDGNADAAR RTTREAYFDS EGEFVETAVY DRAELGAGAT LEGPAIIEQM DSTVVVPPNV
TAEVEQTGNI ILTV