Gene Athe_1063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1063 
Symbol 
ID7409620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1158451 
End bp1160121 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content38% 
IMG OID643715429 
Producttranscription termination factor Rho 
Protein accessionYP_002572937 
Protein GI222529055 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000384674 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCTGGGT CACTTGATGA GTTTTTAAAA GGAAAGTCTA TTATTGAACT TAGAGAAATA 
GCAAAAAGTC TGGGTATTCA GAAATATTCT TTGCTCAAAA AAGGTGAATT GATGGAAGCT
ATTAGAAATT TTCTGGGAAG TTCAAAAGAA GATGCCACAG TTCTTGAGGG CATAGGCAAA
AAAGAAAAAG GAAGAAGAGG AAGAAAAAAG AAATCAGAAA TTGCAGAAGT TCAGCAGACT
TTTGAATTAA AGAGCGAGGA GCTACAATCA AAGGTTGGAG AGACCAATGA GAAGGAAGAG
GAAAGACCAA GTGAAATGGA AAATAAAGAA GAAGATAACA TAAAAGAACA GACTTTGTCT
GAGGATATAC AAAAAGAAGA AGAAAAAACT GAAATGCCAA CAGATGTTAG TTTAAATGAG
GAGAAAAAAG AGGAGAGTAA AGTCATAGAG TTAAAACCTA AAAAAGAAGA AAAAAGAGAG
GATAAAATGC AGATAGAAAT TCCGCCAGAA TTAAAAGAAC TTGAAGGTAA AGTGGAGATA
GGCGGCATAG GCGAAGGAGT TTTGGAGATA ATTTATGAGC CTGGTGGCGG TGGCGGTTAC
GGATTTTTGC GCGACGATTC GTTCGTTCCT GGCCCAAACG ACATATATGT TTCGCCATCT
CAAATCAGAA AATTCAATCT CAAAACCGGA GATAAAATTA GAGGTCCCAT TAGGCTTCCG
AAAGAAAATG AGAAGTTTGC AGGGCTTTTG TATGTTCAGA GCGTCAACGA TATGAAACCA
GAAGAAGTTG CAAAACGCAC TCCTTTTGAA GACCTTACCC CTATTTTCCC AAACAAAAGA
ATAATTTTGG AAAATAAAAA TGAGCCTAAG GATTTAGCAG TTAGACTCAT AGACCTTATT
GCGCCAATTG GAAGAGGACA GAGAGGATTA ATTGTAGCAC CACCAAAAGC AGGTAAAACT
ACGCTATTAA AGAAAATAGC AAATAGTATT CTGACAAACT ATGATGATTT GCATTTGATT
GTACTGCTCA TTGATGAAAG ACCCGAAGAG GTCACTGACA TGCAAGATTC AATAAAAGCA
GAGATACATT ACTCTACATT TGATGAAACT CCTGAACACC ACATAAAAGT TGCTGAAATG
GTTTTAGAGA GGGCTATGAG ACTTGTTGAG TGTAAAAAAG ATGTTGTCAT TTTGTTAGAT
AGCTTGACAA GGCTTGCACG TGCTTATAAC TTAGTTGAAC CACCTTCTGG CAGAACACTT
TCTGGCGGTC TTGACCCGAA CGCTCTTCAT AAACCTAAAA AGTTTTTTGG TGCTGCAAGA
AACCTTAAAG AAGGTGGTAG TCTTACTATC CTTGCAACAG CGTTGATTGA AACAGGGTCA
CGAATGGATG ATGTCATATT TGAAGAGTTC AAGGGTACTG GCAACATGGA GTTGCACCTT
GACAGAAAAC TTTCTGAAAA ACGAATATTC CCGGCTATTG ATATAAACAA GTCAGGGACG
CGAAGAGAGG AACTTTTGCT GTCTGAAGAG GAAAAAGCAG CTGTTGATGC TATCAGAAGA
GCACTTTCTA ACTTTGGAAC AGCTGAAACT ACAGAGAGAA TTATAAGTAT GCTTTCCCAA
ACAAAGTCAA ATGAAGAATT CATAAGAAAG ATATTACAAA ATTTAAGATA A
 
Protein sequence
MPGSLDEFLK GKSIIELREI AKSLGIQKYS LLKKGELMEA IRNFLGSSKE DATVLEGIGK 
KEKGRRGRKK KSEIAEVQQT FELKSEELQS KVGETNEKEE ERPSEMENKE EDNIKEQTLS
EDIQKEEEKT EMPTDVSLNE EKKEESKVIE LKPKKEEKRE DKMQIEIPPE LKELEGKVEI
GGIGEGVLEI IYEPGGGGGY GFLRDDSFVP GPNDIYVSPS QIRKFNLKTG DKIRGPIRLP
KENEKFAGLL YVQSVNDMKP EEVAKRTPFE DLTPIFPNKR IILENKNEPK DLAVRLIDLI
APIGRGQRGL IVAPPKAGKT TLLKKIANSI LTNYDDLHLI VLLIDERPEE VTDMQDSIKA
EIHYSTFDET PEHHIKVAEM VLERAMRLVE CKKDVVILLD SLTRLARAYN LVEPPSGRTL
SGGLDPNALH KPKKFFGAAR NLKEGGSLTI LATALIETGS RMDDVIFEEF KGTGNMELHL
DRKLSEKRIF PAIDINKSGT RREELLLSEE EKAAVDAIRR ALSNFGTAET TERIISMLSQ
TKSNEEFIRK ILQNLR