Gene Mbar_A1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1438 
Symbol 
ID3626119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1771190 
End bp1772287 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content49% 
IMG OID637700327 
Productchorismate synthase 
Protein accessionYP_304976 
Protein GI73668961 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAA ACGTTTTTGG GCAGATGTTT CGAATTACTA CCTGGGGTGA ATCCCACGGC 
AAGGCTGTAG GTGTTGTAGT AGATGGATTG CCTGCAGGGC TTCCTTTCTC CGAGGCTGAT
ATTCAAAAAG AACTGGATAG GAGGCGCCCG GGGCAGAGCG AGGTTTCAAC TCCGCGGCAT
GAAGCTGACA GAGTGGAAAT TCTTTCGGGA ATTTTTGAAG GGATGAGTAC AGGCACTCCT
GTTTCTATGC TTGTCTGGAA CTCGGATGCC AGGTCTTCAG CTTATGATGT CATTAAAGAC
ACTCCCAGGC CCGGACATGC CGATTTTACT TACATGGCCC GCTACGGAAT GAGGGACCAC
CGTGGTGGAG GCAGGTCTTC AGCTCGGGAA ACAATAGGCA GGGTTGCAGG TGGAGCCCTT
GCAAAACTCC TGCTTTCCAG ATTCGGAATC CTAATTGCTG GACATGTGCT TGAACTCGGA
GCCCTCCGCG CAAAACCTCT TTCTTTTGAA GAAATTCTTG AAAATGTAGA AAAGACCCCT
GTACGCTGTG CCGATCTTGA GGCTGCTGAA AAAATGCTTG AGAAAGTTGC GGCCCTTCGG
CAGGAAGGAG ACAGTATCGG TGGCATTGTC GAGCTTATTA TAAGAGGTGT GCCTGCAGGC
CTTGGAGAAC CTGTCTTTGA CCGACTGGAT GCAGACCTTG CAAAAGCTCT GATGAGTATT
CCTGCCGTCA AAGGCTTTGA AATTGGGGCC GGATTTGAAG CTGCTCGCCT GTACGGTAGT
GAAATGAACG ATCCTTTCCG AATAAAGGAA GGAAAAATAA CCACTTCAAG CAATAACGCA
GGTGGAATTC TTGGAGGTAT TTCAACCGGA TTGGACATTG TCTGCAGGGC AGCAGTAAAG
CCAACTCCGT CCATAGGAAA AGTTCAGCAG ACAGTTGACC TCAAAACCCT GGAAAATACT
GAAATTGCAA TAAAAGGCCG GCATGATCCC ACAATTCCTC CGCGCATGGT TCCGGTTGCC
GAAGCTATGG TTGCCCTTGT GATTGCTGAT CATATGCTCA GGAGCGGGTT TATTAATCCG
AGAACTCTGC TGGAATGA
 
Protein sequence
MAGNVFGQMF RITTWGESHG KAVGVVVDGL PAGLPFSEAD IQKELDRRRP GQSEVSTPRH 
EADRVEILSG IFEGMSTGTP VSMLVWNSDA RSSAYDVIKD TPRPGHADFT YMARYGMRDH
RGGGRSSARE TIGRVAGGAL AKLLLSRFGI LIAGHVLELG ALRAKPLSFE EILENVEKTP
VRCADLEAAE KMLEKVAALR QEGDSIGGIV ELIIRGVPAG LGEPVFDRLD ADLAKALMSI
PAVKGFEIGA GFEAARLYGS EMNDPFRIKE GKITTSSNNA GGILGGISTG LDIVCRAAVK
PTPSIGKVQQ TVDLKTLENT EIAIKGRHDP TIPPRMVPVA EAMVALVIAD HMLRSGFINP
RTLLE