Gene Moth_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1963 
Symbol 
ID3831145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2042783 
End bp2044603 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content57% 
IMG OID637829894 
Productcarbon starvation protein CstA 
Protein accessionYP_430804 
Protein GI83590795 
COG category[T] Signal transduction mechanisms 
COG ID[COG1966] Carbon starvation protein, predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.929114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCAC TGGTACTGTT AATTATAGCG GCGCTGGTAT TTGTCCTGGC CTACCGCTTT 
TACGGCGCCT TTATGGCCGC CAAGGTCCTG GCCCTGGACC CGGGGCACCA GACCCCGGCC
CTGATCCATA ACGACGGCCG GGACTATGTG CCGACCAACC GCTGGTTGGT TTTCGGCCAT
CACTTTGCAG CCATAGCCGG CGCCGGACCC CTCATCGGCC CGGTCCTGGC GGCCCAGTTC
GGCTACCTGC CCGGGTTCCT GTGGATCCTC ATCGGCGCCG TGGTCGCCGG CGCCGTCCAT
GATATGGTGA TCCTCTTCGC TTCCGTGCGC CATGACGGCC AGTCGCTGGC GGAAATAGCG
CGCAGCGAAG TGAGCAACTT TTCCTACTGG ATGGCTTCAA TCGCCCTTTT GTTTCTTTTA
ATTGTCGTCC TGGCCGGCGC CAGTGTCTCG GTAGTCAATG CCCTCTATCA GAGCCCCTGG
GGGACTTTTA CCGTAGGCGT GACTATCCCT ATCGCCATTT TTATCGGTGC TTATCTGAAA
TGGCTGCGCC CCGGCCGTAT CGGGGAGGCT ACTGTTATCG GCGTAGCCCT GATTGTCGCC
GGCGTCGTCC TGGGACCGGT CATCCAGCAT TCGTCCCTGG CTCCCTTACT AACCTTTGAT
AAACAACAGC TCTCCCTGCT CATCGCCGCC TACGGTTTTC TGGCGGCGGT GCTGCCGGTA
TGGCTCCTGC TGGTACCCAG GGACTACCTG AGCACTTATA TGAAAATCGG CACCATGTTA
TTGCTGGCCG TTGGCGTCAT TGCCGTCAAC CCCGTCCTGC AGATGCCTTC GGTGACTAAA
TTTGTGGCCG GCGGCGGCCC GGTCATTCCG GGCAAAGTTT GGCCCTTTAT GTTTATCACC
ATCGCCTGCG GGGCCCTCTC CGGTTTCCAC GCCATGGTCT CCAGCGGCAC TACGCCGAAA
ATGATCACCA GTGAGGCTGA CATCAAGGTT GTCGGTTACG GGGCCATGCT GGTGGAAGGC
TTTGTGGCCC TCATGGCCCT GATCGCGGCT ACCGTCCTGG CCCCGGCAGA CTACTTTGCC
ATTAACAGCG CCCCGGAGGT CTTTGCCAAA CTGGGTATGC ATGTCCAGGA CCTGCCCGTG
CTTTCCCAGC TGGTGGGAGA AAACCTGGCC GGCAGGCCGG GCGGCGCTGT ATCCCTGGCG
GCGGGCATGG CCCACATTTT CTCCAGCATC GGCGGTTTAA GGCACCTGAT GGGTTACTGG
TACCATTTCG CCATTATGTT TGAAGCCCTG TTCATTCTGA CGTTGATTGA CGCCGGTACC
CGGGTCGGGC GCTACCTGCT GCAGGAAATT GGTGGGGTAA TCTACAAACC TTTGAAAGAC
ACCAATTGGT GGCCGGGTAT TATCCTCACC AGTGGTATCT TTACTCTAGC CTGGGGTTAT
CTCCTTTACG GAGGTACCAT ATCCACCATC TGGCCCCTTT TCGGGGTAAA CAACCAGCTC
CTGGGAAGCA TGGCCCTGGC CATCGGCACC ACCATGCTCA TCAGGATGGG CAAGGTCCGC
TACGCCTGGA CGACCTTTAT CCCTATGGTC TTTTTGACGG TAACTACTAT AACCGCAGGT
TATCAAAATA TCTTTATAAA CTATCTACCG GCCCATAATT ACCTGCTGGC AGTAATTTCC
ATCATTATGC TTCTGATGGT CATCGCTATC ATTATCGACT CTGTAAGGGT GTGGTTCCAG
CTCCTCTCCG GGAGCAAAAC GGAACTGGAA AAGGGCCGGG CTGCTTCATT AAACGAAACC
GGTTCGGCCC ACACTTATTA A
 
Protein sequence
MNALVLLIIA ALVFVLAYRF YGAFMAAKVL ALDPGHQTPA LIHNDGRDYV PTNRWLVFGH 
HFAAIAGAGP LIGPVLAAQF GYLPGFLWIL IGAVVAGAVH DMVILFASVR HDGQSLAEIA
RSEVSNFSYW MASIALLFLL IVVLAGASVS VVNALYQSPW GTFTVGVTIP IAIFIGAYLK
WLRPGRIGEA TVIGVALIVA GVVLGPVIQH SSLAPLLTFD KQQLSLLIAA YGFLAAVLPV
WLLLVPRDYL STYMKIGTML LLAVGVIAVN PVLQMPSVTK FVAGGGPVIP GKVWPFMFIT
IACGALSGFH AMVSSGTTPK MITSEADIKV VGYGAMLVEG FVALMALIAA TVLAPADYFA
INSAPEVFAK LGMHVQDLPV LSQLVGENLA GRPGGAVSLA AGMAHIFSSI GGLRHLMGYW
YHFAIMFEAL FILTLIDAGT RVGRYLLQEI GGVIYKPLKD TNWWPGIILT SGIFTLAWGY
LLYGGTISTI WPLFGVNNQL LGSMALAIGT TMLIRMGKVR YAWTTFIPMV FLTVTTITAG
YQNIFINYLP AHNYLLAVIS IIMLLMVIAI IIDSVRVWFQ LLSGSKTELE KGRAASLNET
GSAHTY