Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2601 |
Symbol | |
ID | 7409560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2739216 |
End bp | 2740691 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643716970 |
Product | stage IV sporulation protein A |
Protein accession | YP_002574439 |
Protein GI | 222530557 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02836] stage IV sporulation protein A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000143605 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGCTG ATATCTACAG AGAAATAGCC AAAAGGACAA ACGGTGACAT TTACATTGGC GTGGTTGGAC CTGTCAGAAC AGGAAAGTCC ACATTTATAA AAAAATTCAT GGACCTTTTT GTAATTCCTA ACATTGAAGA TGAGTATAAA AAAGAGAGAA CAAAGGATGA ACTACCTCAA AGCGCACAGG GAAAAACCAT AATGACAACA GAGCCCAAGT TTGTTCCCAA TGAAGCTGTT GAAGTTTTGC TATCAAGCGG TGCAAGACTG AAGGTACGGC TTGTTGACTG CGTTGGGTAC CTTGTTGAAG GTGCAATGGG ACATTTGGAA GAAAACCACC CGCGCATGGT CACAACACCG TGGTTCGAAA AGCCAATCCC GTTTGAAGAG GCAGCAGAGA TTGGCACTAA AAAGGTCATC CAGGACCACT CAACGATTGG GGTAGTCATT ACAACAGATG GCACAATTAC TGATATACCA CGAGAAAACT ACATAAAGGC AGAAGAGAGA GTGATTGAAG AATTAAAACA AATCAACAAA CCATTTGTGA TTGTTCTCAA CACCGCAAAA CCCTACTCAC CCGACACACA AGAGCTCAAA AAAGATCTTG AAGAGAAGTA CAAAATGCCA GTTTTGATTG TCAACTGTCT TCAGATGCAG ATTGAGGATG TAAAGAGGAT TTTAGAGACA GTACTGTTTG AGTTTCCTAT TGTTGAAGTA AAGATTAACC TGCCCAGATG GTTTGACGAG ATGGAAGATG AGTCGTGGCT CAAAAAAGAG ATTTATGAAA AGATAAAAGA ATATGCAGAA AAACTTAACA AAATCAGAGA TATAACAGAC CAGCTTGAAG TTTTAAAACA ACTTCCCCAG ATTGACAGGT GTGAGGTTGT AGGAATAAAT TTGGGAGATG GGAAGAGCGA GCTTTCAATT TATTTCAAAG AAGGACTTTT GTTCAGGATT ATTGAGGAGT TCACTGGATT TGAAATAAAA GGCGATCATC ACTTATTAAA ACTTCTTTCT GAACTTGCAC AGGTGAAGAA AGACTATGAT AAACTCAAAG ATGCGCTTGA AGAAGCAGGT GAAAAAGGTT ATGGTATTGT ACCACCATCC TTAAACGAAC TTAAGCTTGA AACTCCTGAG ATAGTCAAAA GAGGAAATAG TTTTGGTGTA AGACTAAAGG CATCAGCTCC CTCTCTTCAC ATTATAAGAG TTGAGGTTGA AACAGAGGTA TCACCAATTG TTGGAACAGA AAAACAAAGT GAAGAGCTTG TAAACTTTTT GATGAGAGAG TTTGAAGATG ACCCAAAAAA GATTTGGGAG TCCAATATAT TTGGAAAATC TCTGCACGAG CTTGTTAAGG AAGGCTTGCA AAACAAGCTA TACAGAATGC CAGAAGATGC TCAAGCAAAG TTGAAAGAAA CATTGCAAAA AATTATAAAT GAGGGAAGTG GCGGGCTTAT TTGTATTATA CTTTAA
|
Protein sequence | MDADIYREIA KRTNGDIYIG VVGPVRTGKS TFIKKFMDLF VIPNIEDEYK KERTKDELPQ SAQGKTIMTT EPKFVPNEAV EVLLSSGARL KVRLVDCVGY LVEGAMGHLE ENHPRMVTTP WFEKPIPFEE AAEIGTKKVI QDHSTIGVVI TTDGTITDIP RENYIKAEER VIEELKQINK PFVIVLNTAK PYSPDTQELK KDLEEKYKMP VLIVNCLQMQ IEDVKRILET VLFEFPIVEV KINLPRWFDE MEDESWLKKE IYEKIKEYAE KLNKIRDITD QLEVLKQLPQ IDRCEVVGIN LGDGKSELSI YFKEGLLFRI IEEFTGFEIK GDHHLLKLLS ELAQVKKDYD KLKDALEEAG EKGYGIVPPS LNELKLETPE IVKRGNSFGV RLKASAPSLH IIRVEVETEV SPIVGTEKQS EELVNFLMRE FEDDPKKIWE SNIFGKSLHE LVKEGLQNKL YRMPEDAQAK LKETLQKIIN EGSGGLICII L
|
| |