Gene Moth_2279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2279 
Symbol 
ID3831390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2388442 
End bp2389956 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content56% 
IMG OID637830199 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_431109 
Protein GI83591100 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4145] Na+/panthothenate symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.257788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000218485 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGTTTT TTATTTTGGG CCTGTATTTT GCCGTGCTTT TGGCAATCGG TTTTTATAGC 
TTGAAGAAAT CGCGGGATGT CGGCGGCTTT TTCCTGGGTA ATCGTACCGT GGGCCCCTGG
GTATCGGCCT TTGCTTACGG GACGACTTAC TTTTCCGCCG TTATTTTTAT AGGTTATGCC
GGTAAAGTGG GGTGGGGTTT CGGCCTGTCC GACCTGTGGA TCGTCATCGG TAACGCCTTC
ATCGGCAGTT TCCTGGCCTG GAAGGTCCTG GCGCGGCCGA CCAGGGAAAT GACGGTTCGC
TTGAAGGCCA TGACCATGCC CGAGTTCCTG GCCGCACGTT ACGACAGCCC GGCTTTACGA
ACGTTGGGGG CGCTGGTAAT CTTTATTTTC CTGGTACCGT ATTCGGCTTC GGTTTATATG
GGTTTGAGTT ACCTCTTCGA GCAGGTTTTC CGCATTAACT TTACCACAGC CCTCATTGCC
ATGGCTGCCT TTACGGCCCT TTATCTGGTC CTGGGCGGCT ACATTGCCGT TACCCTGACT
GATTTCATCC AGGGTTTGAT TATGATTGGC GGCGTTGTGG TCCTGATATA TTATGTTATA
TCTGCGCCGC CTGTGGGCGG GCTGGCTGGC GGCATCAGCC GCCTGGCGGC CATCGACCCG
CGCCTGGTTA ACCCGTTGGG CCCCAACTGG TTCGCCCTCC TTTCCCTGGT AGTGCTCACC
AGCCTGGGAC CCTGGGGCCT GCCCCAGATG GTACAGAAGT TTTACGCTAT TAAAGATGAA
GGTTCCATCT GGCCGGCGAC GGTGGTTTCG ACCCTGTTCG CCCTGATTAT AGCTACCGGT
GCATATTTCA CCGGCGCCTT TGGCCGTCTC TTCTTCAACA ACCAGATGCC CTTGCTTAAC
GGCCAACCTA ACCCGGATCT CATTATGCCC CAGATTATCA ATCATTATCT GCCGCCCTGG
ATAGGCCTGC TCTTGTTGCT GCTGGTGCTG GCGGCCTCCA TGTCTACCCT GGCTTCCCTG
GTACTGGTAT CGAGTTCCGC GGTGGCTATT GACCTGGTCC AGGGCGGAGC TCCTGGGGTT
TCCCGCCGTA TAATCCTGGC CCTGCTGCGT TTCCTGTGTT TCTTCTTTAT TGGCCTTTCG
GTCTATATTG CCCTGAAGCC CACTATTATC CTGGTCCTCA TGTCCCTCTC CTGGGGCACG
GTGGCCGGAG CCTTCCTGGC CCCTTATATC TACGGCCTGT ACTGGCCCCG GACCACTAAA
GCCGGGGCCT GGGCGGGGCT CCTGAGCGGG CTGGCTATAT CCCTGGGCCT GTCCTTTTAC
TACCACCTGG ATGGCAGCGT CATCCCGACG ATAGGCTCCC TGGCCATGGT TATCCCTCTG
GGAGTGGTTC CCATGGTGAG CCTCGTCACG CCGGCCTTTT CCAGGGAACA TCTAGCTAAA
GTTTTTGGTG CTGATAGGAC AACTACTACG GCAACTAAAG GATTGGTGAC TGATGATCTG
GGATTCGGAA AATGA
 
Protein sequence
MKFFILGLYF AVLLAIGFYS LKKSRDVGGF FLGNRTVGPW VSAFAYGTTY FSAVIFIGYA 
GKVGWGFGLS DLWIVIGNAF IGSFLAWKVL ARPTREMTVR LKAMTMPEFL AARYDSPALR
TLGALVIFIF LVPYSASVYM GLSYLFEQVF RINFTTALIA MAAFTALYLV LGGYIAVTLT
DFIQGLIMIG GVVVLIYYVI SAPPVGGLAG GISRLAAIDP RLVNPLGPNW FALLSLVVLT
SLGPWGLPQM VQKFYAIKDE GSIWPATVVS TLFALIIATG AYFTGAFGRL FFNNQMPLLN
GQPNPDLIMP QIINHYLPPW IGLLLLLLVL AASMSTLASL VLVSSSAVAI DLVQGGAPGV
SRRIILALLR FLCFFFIGLS VYIALKPTII LVLMSLSWGT VAGAFLAPYI YGLYWPRTTK
AGAWAGLLSG LAISLGLSFY YHLDGSVIPT IGSLAMVIPL GVVPMVSLVT PAFSREHLAK
VFGADRTTTT ATKGLVTDDL GFGK