Gene Moth_1588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1588 
Symbol 
ID3832734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1621898 
End bp1623517 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content57% 
IMG OID637829517 
ProductCl- channel, voltage gated 
Protein accessionYP_430437 
Protein GI83590428 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0038] Chloride channel protein EriC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000108225 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.782392 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGTG TGGAAACTGT ACTGCAGGAG CATTCGGTAC CAGAGGGTAT ACTATTAAGA 
AGATATAGTT ACCTTCTTCG CTTTTTATGT ACGGGAGCCC TGGCGGGAGC GGGCGCCGGA
CTGGTGGGCG CGGCCTTCAG GCTGGCCCTG ACGGAAGGGG ACTTATGGCG TAATAGCCTC
TTGACATGGG CCAAGGGAAT ACCGCTGTGG GGCTGGCTGG CACTGCCGTT CCTGGGTGCC
CTGGCGGGAG GCCTGGCCGG CTGGTTGACC AGCCTGGCGC CGGAAACAGC CGGGAGCGGT
ATCCCCCATG TGGAAGCAGT TTTAATTAAC CTGCGCCGGT TGGTATGGTG GCGGGTCATA
CCTGTAAAAT TTATAGCCGG GGCCCTGGCC ATCGGGGCCG GGTTATCCCT GGGACCGGAA
GGCCCGGCGG TCCAGATGGG GGCGGCCGCC GGCAAGGCGG TAAGTGACGG TTTTGGCCGC
TCGAAAACCG AGGAACTGCA CCTTATCGCC TGTGGTGCCG GGGCCGGCCT GGCAGCGGCC
TTTAATGCCC CCCTGGCGGG AGTAGTCTTT GTTCTGGAAG AACTAAGACG CAACTTCTCT
CCCTATGCCC TGGGTGGAGC CCTGGTAGCC TCCGTTGTGG CGGACATGGT ATCCCGGCAT
ATCCTGGGAC CTTTACCTAC TTTCCGGTTA ATTGAAGCCT GGCCGGTATT ACCATTAACT
ACCCTGCCGG TATTCCTGGT TTTAGGGGTG CTGGCGGGTA TCCTGGGGGC CGTCTTTAAC
TGGTCCCTCT TAGCCAGTCT CGAACTGGGG GATAGATTAA ACAGGTTCCC TCGCTGGTTG
CGGGCCCTGC TGATCCTTTT TCTAGCAGGT ATCCTGGGCT ATTTTTTACC AGAGGTTCTG
GGGGGCGGGC ACCTGCTGGC GGAAGAAGCC CTGGCCGGAA AGGTCGCCTG GACCCTCATC
CCCCTTCTCT TTGTGGTCAA GTTTCTCTTG ACTATGATTA GTAACAGCGC CGGCGTTCCT
GGGGGCATCT TTTTGCCCCT GCTGGTACTG GGGGCTCTAT TGGGTTCCCT CGTGGGACAG
GTAAGCGGGT TGCTAATCCC CGCCTTTCAA GGCATGGTCC CGGCTTTTGC CATGATCGGT
ATGGCTGCCT ATTTTGTCGC CATTCTGCGC TTACCCCTCA CTGGGGTAGT TTTAATAATC
GAGATGACTG GCAGCTACCG GCATATAGTG TTGCTCCTCT TTACCTGTAT GATTGCTTAC
CTGGTGGTAG AGACCCTGGG GAGCAGACCA GCCTATGAAA TGCTGTTGGA GCGTGACCTG
GCCAGGGCCA GGGTAGAGGC TGAACCATCT CCGGTTGGCA AGATGTTAAT GCTGGACTTT
GCCGTTGAGG CTGGCTCTGA TGCCTGCGGT CGCCTGGTAA GGGACCTGGA ACTGCCTCCG
GATTGCCTGC TGGTTACTAT CCGCCGCAAG GGCAGGGAAA TAATTCCGCG TGGTAATACC
AGCATTCAGG AAGGAGATCA CCTGGCGGTG ATTACCCCTG AAGAACGGGC GGCAGAAATC
TGCCATGAAT TATCAGGGGT AACCCGTTGT AAGTTCCAGC AGAAATTGCA ACGATTCTGA
 
Protein sequence
MARVETVLQE HSVPEGILLR RYSYLLRFLC TGALAGAGAG LVGAAFRLAL TEGDLWRNSL 
LTWAKGIPLW GWLALPFLGA LAGGLAGWLT SLAPETAGSG IPHVEAVLIN LRRLVWWRVI
PVKFIAGALA IGAGLSLGPE GPAVQMGAAA GKAVSDGFGR SKTEELHLIA CGAGAGLAAA
FNAPLAGVVF VLEELRRNFS PYALGGALVA SVVADMVSRH ILGPLPTFRL IEAWPVLPLT
TLPVFLVLGV LAGILGAVFN WSLLASLELG DRLNRFPRWL RALLILFLAG ILGYFLPEVL
GGGHLLAEEA LAGKVAWTLI PLLFVVKFLL TMISNSAGVP GGIFLPLLVL GALLGSLVGQ
VSGLLIPAFQ GMVPAFAMIG MAAYFVAILR LPLTGVVLII EMTGSYRHIV LLLFTCMIAY
LVVETLGSRP AYEMLLERDL ARARVEAEPS PVGKMLMLDF AVEAGSDACG RLVRDLELPP
DCLLVTIRRK GREIIPRGNT SIQEGDHLAV ITPEERAAEI CHELSGVTRC KFQQKLQRF