Gene MCA1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1784 
SymbolpurB 
ID3102876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1914052 
End bp1915413 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content63% 
IMG OID637170944 
Productadenylosuccinate lyase 
Protein accessionYP_114222 
Protein GI53804158 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.47541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCCA TCAAGGCATT GTCGCCAGTA GACGGCCGTT ACGCCGGCAA GGCCGACGCC 
CTCCGCAACA CCTTCAGTGA ATACGGCCTC ATCCGGTTCC GGATCCTGGT CGAATTGCGC
TGGCTGGAGG CGCTGGCGGC CGAGCCCACG ATCACCGAGG TTCCCCCTTT GAGTGGCGAA
GCGCGCGATC GTCTGAACCG GATCGTGGAC GAATTCGGCG AAGATCATGC CGAACGGGTC
AAGACCATCG AGCGGACCAC CAACCACGAC GTCAAGGCGG TCGAATATTT CCTGAAGGAG
AGGATCCAGG GCTGCCCCGA ACTGGAGCGG ATCGCCGAGT TCATCCATTT CGCCTGCACC
TCTGAAGACA TCAACAATCT TGCCTATGGG CTGATGGTGA AGGAGGCGCG TGACACGGTC
CTGCTGCCGG CGATGGACGA GTTGATCGAG GCGGTGCGGG AGCGGGCGCA TGTCTATGCC
GGCCAGCCGA TGCTTTCGCG CACTCACGGC CAGCCGGCGA CGCCCACCAC GGTCGGGAAG
GAGTTCGCCA ACTTCGCCGC CCGGCTGGCT CGCCAACGCG AGCAAGTAGC GGCGGTGGCC
CTGATGGGCA AGATCAACGG TGCGGTCGGC AATTTCAACG CCCATGCGGT CGCCTACCCC
GAAGTCGATT GGCCCAAACT GGCACAAGGT TTCGTCGAGT CGCTGGGGCT GGCCTGGAAC
CCTTATACCA TACAGATCGA GCCACACGAT TATCTGGCCG AGCTGTGCCA TGCCTACAGC
CGTTTCGGCA CCGTGCTGAT CGACTTCGAC CGCGACGTCT GGGGCTACAT CTCGCTGGGT
TTTTTCCGGC AGAAGACCGT GGCCGGCGAA GTCGGCTCTT CCACCATGCC GCACAAGGTC
AACCCGATCG ATTTCGAGAA CTCGGAAGGC AACCTCGGTA TCGCCAACGC GCTGTTCTCG
CATTTCGCCG AAAAGCTGCC GATTTCCCGC TGGCAGCGCG ATCTAACCGA CTCCACGGTG
CTGCGCAATT TCGGCGTCGG CCTCGCCCAT CTGCTCATCG CGCTCGGTTC CACTCTGAAA
GGCCTGGGCA AGCTGGAGCT GAGCCCTCCG GTCCTGGAGG CCGATCTCGA CGGCAACTGG
GAAGTGCTGG CCGAGGCGAT CCAGACCGTG ATGCGCCGCT ATGGCGTGGA ACGGCCCTAC
GAGAAGCTCA AGGCCTTGAC CCGCGGCCAG CGGGTGGACG CGGAGGGCCT GCGTGCCTTC
GTAGAGACCC TGGAGATACC CGAGGAGGCG CGCAGCCGCC TGGCAGCGCT GGCTCCCCGC
GATTACATCG GCTACGCTGA AACCTTCGCC AAAACCATCT GA
 
Protein sequence
MNSIKALSPV DGRYAGKADA LRNTFSEYGL IRFRILVELR WLEALAAEPT ITEVPPLSGE 
ARDRLNRIVD EFGEDHAERV KTIERTTNHD VKAVEYFLKE RIQGCPELER IAEFIHFACT
SEDINNLAYG LMVKEARDTV LLPAMDELIE AVRERAHVYA GQPMLSRTHG QPATPTTVGK
EFANFAARLA RQREQVAAVA LMGKINGAVG NFNAHAVAYP EVDWPKLAQG FVESLGLAWN
PYTIQIEPHD YLAELCHAYS RFGTVLIDFD RDVWGYISLG FFRQKTVAGE VGSSTMPHKV
NPIDFENSEG NLGIANALFS HFAEKLPISR WQRDLTDSTV LRNFGVGLAH LLIALGSTLK
GLGKLELSPP VLEADLDGNW EVLAEAIQTV MRRYGVERPY EKLKALTRGQ RVDAEGLRAF
VETLEIPEEA RSRLAALAPR DYIGYAETFA KTI