Gene Mboo_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0803 
Symbol 
ID5410529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp772687 
End bp773880 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content60% 
IMG OID640868027 
Productcitrate transporter 
Protein accessionYP_001403964 
Protein GI154150346 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.324785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCAGG TGGGCCGGTT CTCCTGGCGT ATCTGGCAGG TCATGCTCGG CGGAGCACTG 
GCAGTCCTCA TCCTTGGCCA GATCGCCCCT GCTGACGCCC TTGCCGCAAT CAACATCGAT
GTGATGGTCT TTCTCTTCGG CATGTTTGTC GTGGGAGAGG CACTCTCGCG GAGCGGGTAC
CTCGATCTTC TTGCCCGCCA GCTCTTCCGG CACGCCCGCA CACCGGGCCA ACTCCTCTTT
TTTGTGATCT TTGGTTTCGG GCTCCTCTCT GCACTGCTCA TGAACGATAC CCTTGCCATC
ATCGGCACAC CACTTGTACT CGGGCTTGCC ACCCGGTGCC GTCTCCCGGC AAAGCTCCTG
CTCCTTGCCC TTGCCTTTGC CATCACCACC GGGAGCGTGG CAAGCCCGAT AGGAAACCCG
CAGAACCTGC TCGTGGCCCT TGACAGCGGG ATGGGCGCAC CGTTTGTCAC CTTTGCCTCC
CACCTCCTCC TGCCAACGAT CCTGTCCCTT GCCGCTGCGT GGCTGATCCT TTTTTTCTTT
TACCGGAAAG GCTGGTCCAG TCTGCTGGAT AAAGAGGCCG GCGCTGACCC TGCGCCGGAG
TCGGACCCCG CCCTTTCCCG GATCGTGAAA TGCTCGCTTG CAGTCCTCCT CATTCTTTCC
GGCGCAAACA TTGCCGCATC ACTCCTGACC GGCGCGATTG CCCTCCCCCT GCCGCTCATT
GGGATTGCCG CTGCACTCCC TGTCATTCTC TTCTCCTCAC AACGGATCGC AGTGCTCAAA
TCCATCGACT GGTGCACGCT CGTTTTCTTT GCCGCGATGT TTGTCCTGAT GGCCGCAGTC
TGGGAGACCG GGTTTTTCCA GTCGCTCGCC GGCACCGCAG GAGTGACCTC GGTCCCCACA
ATCCTTGCGA CAAGCATTAT CCTCAGCCAG TTCATCTCGA ACGTACCTTT TGTCGCCCTC
TTTACGCCCC TCATCCTCCA GGCAGGGGGC GGGACCACCC GGCTCATGGC GCTTGCTGCC
GGGAGCACGA TTGCGGGCAA CGTTACGATC CTCGGTGCTG CAAGCAACGT GATCATCATC
CAGCAGGCCG AAAGCCGGGG GGAAACACTG ACGTTTATGG AATTTATGAA GATCGGCGTG
CCCCTGACAC TAATACAGGT GGGGATATAT GCGGTGTGCC TGGGGGTGGT GTAA
 
Protein sequence
MRQVGRFSWR IWQVMLGGAL AVLILGQIAP ADALAAINID VMVFLFGMFV VGEALSRSGY 
LDLLARQLFR HARTPGQLLF FVIFGFGLLS ALLMNDTLAI IGTPLVLGLA TRCRLPAKLL
LLALAFAITT GSVASPIGNP QNLLVALDSG MGAPFVTFAS HLLLPTILSL AAAWLILFFF
YRKGWSSLLD KEAGADPAPE SDPALSRIVK CSLAVLLILS GANIAASLLT GAIALPLPLI
GIAAALPVIL FSSQRIAVLK SIDWCTLVFF AAMFVLMAAV WETGFFQSLA GTAGVTSVPT
ILATSIILSQ FISNVPFVAL FTPLILQAGG GTTRLMALAA GSTIAGNVTI LGAASNVIII
QQAESRGETL TFMEFMKIGV PLTLIQVGIY AVCLGVV