Gene Athe_1141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1141 
Symbol 
ID7408723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1236114 
End bp1237376 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content36% 
IMG OID643715507 
Productnuclease SbcCD, D subunit 
Protein accessionYP_002573015 
Protein GI222529133 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00619] exonuclease SbcD 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATTA GAGGAGTTCA TACAGCTGAC CTTCATTTTG GAGTGACAAC TTATAGCCGC 
GAAACTCCAG ACGGACTTGG CTCACGTGTA CATGACTTTT TCAAAACATT TGACAGAATA
TTGCAGTTTA TAAGAGATAA CAGCATTGAC TTTTTGCTTA TAACAGGCGA TATTTTTAAA
GACAGGGAAC CAAACTCTAC GCTGAGGAAT ATGTTTTACA AAAGGATTGT GGATATTTCG
AAAGAAGGTG TTTTGGTCAT TATAATTCCA GGCAATCATG ATATGCATCC ATTTGAGACA
AAAGACCATT CTGTAAAGGT CTTTGAGATA TTTGAACAGC CAAACATTGT TGTAATGGAC
AAACCTTTTG AAGTCAAAGA ATTTGAAATA AGAGATGAAA AGCTTCGCAT TATAGCTGTG
CCATACCTTT ATCTTGAAAG GTTTGTGGAT GAGACATTTC CTCAAAAGAC TGAAGAGCTT
GATATGATAG CAGCCAATTT TTTTGAGAAA AAACTAAGCC AGGTTTTAGA CTCTTCAGAA
GATAATATTC CAACAATACT TGCTGGGCAC TTTACTGTAG TAGAGGCGCA GATTGGAAGC
GAAAGATCAA TTATGCTTGG CAAAGACATA AAAGTGCCTC TTTCATGCCT TTTAAATCCA
AAGTTAAAAT TTGTGGCCCT TGGACATATT CACAAACCTC AGATTTTACA TGCAGCAAAC
CCTACTGTGC TGTATTGTGG GTCGCCTGAC AGAATAGATT TTTCTGAAGC AAACGACAGC
AAGGGGTTTG TTGTGTTTGA ATTAGATAAA GATAGCTTTA GGTTTGAGTT TCAACCTGTT
AAGGTAAGAC CTTTTTGCCA GTTGGAGATT GATGTGTTTG AAGACCAAGT AGAAAATCTC
ACCAAAAAGA TTCTTGACAA AATAGAAGAG AAAATACAAA TGTTTGAGCA AAGTACTTCA
AGTAGTATTC AGGTTTCGGT TGTAAAGCTC ATAATAAAAA CTCAGAGTTT GATAAAAGAG
AAGATTGACA TTGGGCTTGT TGAAAGGTTT TTGAGAGACA GATGTTTTGT TTTAGCGCCT
ATCGAAATTG AGGTAATTGA CTCAAAGAAA GATTTTAGAA TTGCTGAGGT TGATGAAAAG
TCGGACCCTG TTGAGGCATT CGAAAAGTTT TTATCTGCAA GCCAGAAATA CAGGGATGTA
GAAAATAAAG ATAAGATTGT ATCAGAATTT AAAAAACTTC TACATGAAAT CCAAGAAAAA
TAA
 
Protein sequence
MAIRGVHTAD LHFGVTTYSR ETPDGLGSRV HDFFKTFDRI LQFIRDNSID FLLITGDIFK 
DREPNSTLRN MFYKRIVDIS KEGVLVIIIP GNHDMHPFET KDHSVKVFEI FEQPNIVVMD
KPFEVKEFEI RDEKLRIIAV PYLYLERFVD ETFPQKTEEL DMIAANFFEK KLSQVLDSSE
DNIPTILAGH FTVVEAQIGS ERSIMLGKDI KVPLSCLLNP KLKFVALGHI HKPQILHAAN
PTVLYCGSPD RIDFSEANDS KGFVVFELDK DSFRFEFQPV KVRPFCQLEI DVFEDQVENL
TKKILDKIEE KIQMFEQSTS SSIQVSVVKL IIKTQSLIKE KIDIGLVERF LRDRCFVLAP
IEIEVIDSKK DFRIAEVDEK SDPVEAFEKF LSASQKYRDV ENKDKIVSEF KKLLHEIQEK