Gene Mboo_0739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0739 
Symbol 
ID5410997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp704308 
End bp705732 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content59% 
IMG OID640867960 
Productargininosuccinate lyase 
Protein accessionYP_001403900 
Protein GI154150282 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.730139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.199189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTCC TCTCATCGAT GCGATCCGAC CGCTGTATTG CCGATGCTGA TGTGCTGGTG 
GATATCGCCC ACGTGCTGAT GCTTGAACGC CAGAAGATCA TCAGCACCGA TACGGCAAAG
CAGCTCCTGC CGGCGCTCCT GGAACTCAAC AGCGAAGGTG TTCCCGAAGA GGTCTTTGAC
GACCGGTTCG AGGATGTCCA CGCAGGGATT GAGACCCTGC TTATCGGGGC TGTGGGCGCC
GATGTCGGTG GCCGCATGCA CATGGGGCGG TCGCGGAACG ACGAGGTGGC AACCTGCATA
CGGTTCAAGC TCCGGGAAGA CCTGCTCAAG CAACTGGCTG CACTTCTCAA GGTCCGCGAG
GTGCTGGTTA CCCTTGCCGA GCAGCACCGG GAATCGGTCA TGCCCGGTTT TACCCACATG
CAGCATGCCC AGCCCACCAC CCTTGCCCAT CATCTCCTTG CCTATGAGCA GCAGTTCACC
CGGGACTTTG ACCGGCTCCG CGATGCCTAT GCCCGGGTCA ACCTCTGCCC GCTCGGCGCT
GCTGCATTTG CCTCCACCGG CTACCCCATC GATCGCGAGT ATACTGCAAA ACTGCTCGCA
TTCGATGGCC CTGTTGTGAA CACAATGGAT GCGGTCGCCA CCCGTGACTT TGCCCTTGAA
ACCCTTGCCG ATCTCTCCAT TCTCATGGCG AACGTGAGCC GGCTCTGCGA GGAAATGGTT
ATCTGGAGTA CCTCGTTTGT GAAATTTGTG GTGCTCGATG ACGCGTTCTG CTCGACCTCG
TCGATCATGC CACAGAAGAA GAACCCGGAC ACCGCCGAGA TCATGCGGGC AAAGACCGGC
TCGGTCTTTG GGGCTTACAC GGGAGCGCTC ATGACGGTTA AGGGCCTGCC CATGAGCTAC
AACCGCGACC TCCAGGAGCT CACCCCCAAT ATCTGGCGCG GGATGCGGGA CGCAAAAGAG
AGCCTGCGCC TCCTCATCGA CATGCTTTCG AGCGCCACGT TCGACACCGA GCGGATGAAG
GAAGAGGCCG GCAAAGGTTT CTCCACCGCA ACAGAACTTG CCGACACCCT CGTTCGGGAC
TACGGCCTGC CTTTCCGCAT GGCCCACAAT ATCGTGGGCC GGGCCGTCCA GAAGGGGAGC
CTCTCCTTAG AGATGCTTGA AGCCGCGGCA AAGGAGCTTG ATGTCGGGAT CTCCTTTACC
GCAAAAGGGC TTACGCAGGA CCAGATTGAC AAGGCACTCG ATGTGACGCA CTCGGTGGAA
GTCCGGCGGG CGACCGGTGG GCCGGCTCCC TTTGCAACAA AGATTGCCAT CGCAGACCGG
AAGAAACTGC TCGATACCGA TTCGGTATTC ATTGACGAGC GACTGGCAAA GATCGCAAAG
GCCAAAGAGG ACCTGATATC AGATGCACGG AGGCTGGTAG CATAA
 
Protein sequence
MRFLSSMRSD RCIADADVLV DIAHVLMLER QKIISTDTAK QLLPALLELN SEGVPEEVFD 
DRFEDVHAGI ETLLIGAVGA DVGGRMHMGR SRNDEVATCI RFKLREDLLK QLAALLKVRE
VLVTLAEQHR ESVMPGFTHM QHAQPTTLAH HLLAYEQQFT RDFDRLRDAY ARVNLCPLGA
AAFASTGYPI DREYTAKLLA FDGPVVNTMD AVATRDFALE TLADLSILMA NVSRLCEEMV
IWSTSFVKFV VLDDAFCSTS SIMPQKKNPD TAEIMRAKTG SVFGAYTGAL MTVKGLPMSY
NRDLQELTPN IWRGMRDAKE SLRLLIDMLS SATFDTERMK EEAGKGFSTA TELADTLVRD
YGLPFRMAHN IVGRAVQKGS LSLEMLEAAA KELDVGISFT AKGLTQDQID KALDVTHSVE
VRRATGGPAP FATKIAIADR KKLLDTDSVF IDERLAKIAK AKEDLISDAR RLVA