Gene Mboo_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0003 
Symbol 
ID5412086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2146 
End bp3303 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content57% 
IMG OID640867217 
Productaminotransferase, class I and II 
Protein accessionYP_001403170 
Protein GI154149552 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAACT TTGTCTCCGA CCGCGCCCGG GATATCCCGC CCTCAGGAAT ACGGAAATTC 
TTTGACCTTG CCCTCACGAT GAGTGATGTC ATCTCGCTGG GTGTCGGCGA ACCCGATTTC
CGGACCCCTT GGAACATTTG CGAAGCCGGC ATCTATTCGG TTGAACAGGG TGCAACCTCG
TATACCCCGA ACCGTGGGCT TCAGACCCTT CGCAGGGCAC TTGCCATACA CCTCGCAAAC
CGGTACCAGC TCCGGTACTC CCCGGACGAT GAGATGATCA TCACCACCGG GGTTTCGGAG
GGGCTTGATA TCGCCATCCG GGCCATTGTC AACCCCGGCG ATGAAGTGCT GATTGCTGAA
CCCAGCTATG TCTCCTATGC CCCATGCGTA GCCCTTGCCG GCGGGATTCC GGTTCCGGTC
GAATGCACCG AACAGGATCA CTTCCGTCTC AACCCGGATA AGCTTCAGGA AAAGATTACC
CCTAAGTCGA AGGCCCTCAT CGTCAATTTC CCCAACAATC CGACCGGCGC GATCATGAGA
AAGGAGGATC TGGAACCGAT TGCCGATATC GTTACTGACC GGGATCTGAT GCTTATCAGC
GACGAAGTCT ACTCGGAGCT CACGTACGAA TCCCCCCATG TTGCTGCGGC AACAGTAAAA
GACCTCCGGG AACGGACGAT CACGCTTAAC GGGTTCTCCA AGGCCTATGC AATGACCGGC
TGGCGTATCG GGTATCTCTG TGCACCAAAA GAGATCTGCG ATGCCGCGCT CAAGATCCAC
CAGTATGTCA TGCTCTGCGC TCCGGCGATG GGGCAGATCG GAGCGCTTGA GGCACTCCGC
TCCGCAGAAG ATGAGAAGAC AAGTATGATC AGCGAGTACC GCCTCCGGCG TAACTCGTTT
GTGGCCGGCC TCAACCGGAT CGGCCTCTCC TGCCACGTGC CCGAGGGTGC GTTTTATGCC
TTCCCCTCGG TTAAAGGTAC TGGGCTTTCC GATGTAGAAT TTGCAGAACG GTTGCTCCGG
GAACAGCGTG TTGCCGTAGT GCCGGGCAGC GTGTTTGGGG CCGGCGGGGA ATACCACCTC
CGCTGTGCGT ATGCAGTCTC ACGAGATGAG CTCACAGAAG CCTTGGGACG GATGGAGTCC
TTCATCAACG GTCTCTAA
 
Protein sequence
MRNFVSDRAR DIPPSGIRKF FDLALTMSDV ISLGVGEPDF RTPWNICEAG IYSVEQGATS 
YTPNRGLQTL RRALAIHLAN RYQLRYSPDD EMIITTGVSE GLDIAIRAIV NPGDEVLIAE
PSYVSYAPCV ALAGGIPVPV ECTEQDHFRL NPDKLQEKIT PKSKALIVNF PNNPTGAIMR
KEDLEPIADI VTDRDLMLIS DEVYSELTYE SPHVAAATVK DLRERTITLN GFSKAYAMTG
WRIGYLCAPK EICDAALKIH QYVMLCAPAM GQIGALEALR SAEDEKTSMI SEYRLRRNSF
VAGLNRIGLS CHVPEGAFYA FPSVKGTGLS DVEFAERLLR EQRVAVVPGS VFGAGGEYHL
RCAYAVSRDE LTEALGRMES FINGL