Gene Mboo_2111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2111 
Symbol 
ID5411209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2186100 
End bp2187041 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content59% 
IMG OID640869356 
ProductABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like protein 
Protein accessionYP_001405268 
Protein GI154151650 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGATG AGACCATCAG GATCGGCCAT CTCTCCACCC TTTACCATAC TGCGTTCCTG 
CTCCGCGGCT CGGATCTTCT TGCACAACGG GGTGTCCGTG CAACCTGGTC GCTCTTCCCC
TCGGGGCCGG ATATCATCAG TGCAATGCAG GCAGGGCGGC TTGACCTTGG GTATATCGGT
ATGCCACCGG TCATCATAGG GATCGACCGG GGGCTGGAAC TTGCCTGTAT CGCCGGCGGC
CATATCGAAG GAACGGTCAT GATTGCGGAC AGTACGATCC GGACCCTTGA TGAATGCGGC
AGCATGCAGG CGTTCTTTTC CCAGCTTGCA GGAAAAGCGA TCGGGACACC CCCGAAAGGC
TCCATTCATG ATGTGATCGT TACCGATCTG CTGGAAAAGA ACAGGAGGCC GGACATCTCC
GTGCGCAACT ATCCCTGGGC AGACTTCCTC TCCGATGCAC TCGTACAGAA GGAGATTGCC
GCTGCCGCCG GTACCCCGGC GCTTGCAACA ACTGCCCGGA CGTACGGGAA TGGCAGGATC
GTGATCCCGC CGGACCGGCT CTGGCCGTTC AATCCCAGCT ATGGCATCGT GGTGATGCGC
AGGATGCTTA AAAATCGCGA TCTCCTTACC CGGTTTTTAA CCGCCCATGA GGCTGCATGC
GAGTGGATCC GCAGTGACCC GGCTGCATGT GCACGGATCG TGGCAGGGAC AACCGGGATG
GTGGACCCAG GTTTTGTTCT TGAAACCTAC CGGATCTCAC CGAAATACTG CGCGGCGCTG
CCGCCGGAGT ATATCGCGTC CACCATGAAG TTCGCACAAA CGCTTCATAC CCTCGGGTAT
ATTTCCCGCC TGATCCGCGA GGACGAGTGC TTTGAGCGGT CACTGATAGA AATCGTCCAC
CCGGGACCCC ACCATTACGC TGACGGGATC GCAGACGCGT GA
 
Protein sequence
MPDETIRIGH LSTLYHTAFL LRGSDLLAQR GVRATWSLFP SGPDIISAMQ AGRLDLGYIG 
MPPVIIGIDR GLELACIAGG HIEGTVMIAD STIRTLDECG SMQAFFSQLA GKAIGTPPKG
SIHDVIVTDL LEKNRRPDIS VRNYPWADFL SDALVQKEIA AAAGTPALAT TARTYGNGRI
VIPPDRLWPF NPSYGIVVMR RMLKNRDLLT RFLTAHEAAC EWIRSDPAAC ARIVAGTTGM
VDPGFVLETY RISPKYCAAL PPEYIASTMK FAQTLHTLGY ISRLIREDEC FERSLIEIVH
PGPHHYADGI ADA