Gene Athe_2135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2135 
Symbol 
ID7408844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2269447 
End bp2270574 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content36% 
IMG OID643716500 
Productprotein of unknown function DUF362 
Protein accessionYP_002573983 
Protein GI222530101 
COG category[S] Function unknown 
COG ID[COG2006] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTAAAG TAGCAATTGT AAAGTGTGAT ACATATGAAG TGCAGGATGT AAAAGAGGCA 
ATTTTGAAGG GTATTAATCT AATAGGGTAT GAAATTCCAA AAAAAGACTG CGTGCTTGTA
AAACCTAATC TTCTTATGAA AAAAAACCCT GAGGATGCTG TTACCACTCA TCCTGCTGTT
GTTCAGGCAA CTGTGGAAAT CATTAAAGAA AAAGCAAGCA AGATTATAAT AGCTGACAGT
CCAGGTGGAC CGTATACCAA AAAGAGGCTT GAAAGTATAT ATCAAGCTGC AGGAATTAAA
GTGCTTGAAA AGCTTGAAAA TGTTTATTTG AACTATGATA CATCTTACAA AGACCTTAGT
ATTGAAACTA CAACTTTTAA AAAACTTTCG TTGATATCCC CATACTTTGA AAGCCAGGGG
ATCATAAATC TTCCAAAACT TAAAACCCAT CAGATGGCAG TCTATACAGG TGCTGTAAAG
AACTTATTTG GGCTTATTCC AGGTGGGCAG AAAGCTGAGA TGCATTTTAG ATTCCAAGAA
GTCAGAAGGT TTATGGAGAT GCTACTTGAA ATCTTAACGG TAGCAAAGCC AATGCTAAAT
ATTATGGACG GCATTGTTGC AATGGAAGGG GAAGGACCGT CAGCAGGAAA ACCCAAAGAG
CTTGGTATTT TGCTAATATC CGAGGATGCA ATTGCTTTGG ATTATGTGGC TTGTAAGATC
ATTGGGCTTG ACATAAAAGA TGTTCCTCTT TTGGAGGTGG CAAATGAAAA AGGACTTTTA
AATCCTGATA AGGTTGAGAT TGTTGGTGAG AGGATTGAAG ATGTTGCTCC ATCAAATTTT
GAACTTGTCT TAAGACCCGA GATATCTTTT GTGAAAGGAA GGCTTCCACA ATTTTTGGCA
AGGTTTTTGG ATAATTTTCT TTCACCAAGA CCAATTTTCG ACATAAATAT TTGTATAGGT
TGTGCAGAGT GTTTCAATGC ATGTCCTGCT CAGGCTATTG AGATGAGAAG CAGGAAAGCA
TATGTTGATT TGAAAAAGTG TATAAGGTGT TATTGCTGTC ATGAACTTTG TCCTTCAAAG
GCTATAAAAA TCAAAAGATC ATTTTTATTT GAGAAAGTAT TAAAATGA
 
Protein sequence
MCKVAIVKCD TYEVQDVKEA ILKGINLIGY EIPKKDCVLV KPNLLMKKNP EDAVTTHPAV 
VQATVEIIKE KASKIIIADS PGGPYTKKRL ESIYQAAGIK VLEKLENVYL NYDTSYKDLS
IETTTFKKLS LISPYFESQG IINLPKLKTH QMAVYTGAVK NLFGLIPGGQ KAEMHFRFQE
VRRFMEMLLE ILTVAKPMLN IMDGIVAMEG EGPSAGKPKE LGILLISEDA IALDYVACKI
IGLDIKDVPL LEVANEKGLL NPDKVEIVGE RIEDVAPSNF ELVLRPEISF VKGRLPQFLA
RFLDNFLSPR PIFDINICIG CAECFNACPA QAIEMRSRKA YVDLKKCIRC YCCHELCPSK
AIKIKRSFLF EKVLK