Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1963 |
Symbol | |
ID | 3831145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2042783 |
End bp | 2044603 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637829894 |
Product | carbon starvation protein CstA |
Protein accession | YP_430804 |
Protein GI | 83590795 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG1966] Carbon starvation protein, predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.929114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCAC TGGTACTGTT AATTATAGCG GCGCTGGTAT TTGTCCTGGC CTACCGCTTT TACGGCGCCT TTATGGCCGC CAAGGTCCTG GCCCTGGACC CGGGGCACCA GACCCCGGCC CTGATCCATA ACGACGGCCG GGACTATGTG CCGACCAACC GCTGGTTGGT TTTCGGCCAT CACTTTGCAG CCATAGCCGG CGCCGGACCC CTCATCGGCC CGGTCCTGGC GGCCCAGTTC GGCTACCTGC CCGGGTTCCT GTGGATCCTC ATCGGCGCCG TGGTCGCCGG CGCCGTCCAT GATATGGTGA TCCTCTTCGC TTCCGTGCGC CATGACGGCC AGTCGCTGGC GGAAATAGCG CGCAGCGAAG TGAGCAACTT TTCCTACTGG ATGGCTTCAA TCGCCCTTTT GTTTCTTTTA ATTGTCGTCC TGGCCGGCGC CAGTGTCTCG GTAGTCAATG CCCTCTATCA GAGCCCCTGG GGGACTTTTA CCGTAGGCGT GACTATCCCT ATCGCCATTT TTATCGGTGC TTATCTGAAA TGGCTGCGCC CCGGCCGTAT CGGGGAGGCT ACTGTTATCG GCGTAGCCCT GATTGTCGCC GGCGTCGTCC TGGGACCGGT CATCCAGCAT TCGTCCCTGG CTCCCTTACT AACCTTTGAT AAACAACAGC TCTCCCTGCT CATCGCCGCC TACGGTTTTC TGGCGGCGGT GCTGCCGGTA TGGCTCCTGC TGGTACCCAG GGACTACCTG AGCACTTATA TGAAAATCGG CACCATGTTA TTGCTGGCCG TTGGCGTCAT TGCCGTCAAC CCCGTCCTGC AGATGCCTTC GGTGACTAAA TTTGTGGCCG GCGGCGGCCC GGTCATTCCG GGCAAAGTTT GGCCCTTTAT GTTTATCACC ATCGCCTGCG GGGCCCTCTC CGGTTTCCAC GCCATGGTCT CCAGCGGCAC TACGCCGAAA ATGATCACCA GTGAGGCTGA CATCAAGGTT GTCGGTTACG GGGCCATGCT GGTGGAAGGC TTTGTGGCCC TCATGGCCCT GATCGCGGCT ACCGTCCTGG CCCCGGCAGA CTACTTTGCC ATTAACAGCG CCCCGGAGGT CTTTGCCAAA CTGGGTATGC ATGTCCAGGA CCTGCCCGTG CTTTCCCAGC TGGTGGGAGA AAACCTGGCC GGCAGGCCGG GCGGCGCTGT ATCCCTGGCG GCGGGCATGG CCCACATTTT CTCCAGCATC GGCGGTTTAA GGCACCTGAT GGGTTACTGG TACCATTTCG CCATTATGTT TGAAGCCCTG TTCATTCTGA CGTTGATTGA CGCCGGTACC CGGGTCGGGC GCTACCTGCT GCAGGAAATT GGTGGGGTAA TCTACAAACC TTTGAAAGAC ACCAATTGGT GGCCGGGTAT TATCCTCACC AGTGGTATCT TTACTCTAGC CTGGGGTTAT CTCCTTTACG GAGGTACCAT ATCCACCATC TGGCCCCTTT TCGGGGTAAA CAACCAGCTC CTGGGAAGCA TGGCCCTGGC CATCGGCACC ACCATGCTCA TCAGGATGGG CAAGGTCCGC TACGCCTGGA CGACCTTTAT CCCTATGGTC TTTTTGACGG TAACTACTAT AACCGCAGGT TATCAAAATA TCTTTATAAA CTATCTACCG GCCCATAATT ACCTGCTGGC AGTAATTTCC ATCATTATGC TTCTGATGGT CATCGCTATC ATTATCGACT CTGTAAGGGT GTGGTTCCAG CTCCTCTCCG GGAGCAAAAC GGAACTGGAA AAGGGCCGGG CTGCTTCATT AAACGAAACC GGTTCGGCCC ACACTTATTA A
|
Protein sequence | MNALVLLIIA ALVFVLAYRF YGAFMAAKVL ALDPGHQTPA LIHNDGRDYV PTNRWLVFGH HFAAIAGAGP LIGPVLAAQF GYLPGFLWIL IGAVVAGAVH DMVILFASVR HDGQSLAEIA RSEVSNFSYW MASIALLFLL IVVLAGASVS VVNALYQSPW GTFTVGVTIP IAIFIGAYLK WLRPGRIGEA TVIGVALIVA GVVLGPVIQH SSLAPLLTFD KQQLSLLIAA YGFLAAVLPV WLLLVPRDYL STYMKIGTML LLAVGVIAVN PVLQMPSVTK FVAGGGPVIP GKVWPFMFIT IACGALSGFH AMVSSGTTPK MITSEADIKV VGYGAMLVEG FVALMALIAA TVLAPADYFA INSAPEVFAK LGMHVQDLPV LSQLVGENLA GRPGGAVSLA AGMAHIFSSI GGLRHLMGYW YHFAIMFEAL FILTLIDAGT RVGRYLLQEI GGVIYKPLKD TNWWPGIILT SGIFTLAWGY LLYGGTISTI WPLFGVNNQL LGSMALAIGT TMLIRMGKVR YAWTTFIPMV FLTVTTITAG YQNIFINYLP AHNYLLAVIS IIMLLMVIAI IIDSVRVWFQ LLSGSKTELE KGRAASLNET GSAHTY
|
| |