Gene Athe_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1921 
Symbol 
ID7407334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2026225 
End bp2028483 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content34% 
IMG OID643716293 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_002573782 
Protein GI222529900 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00848355 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGAA AGGCGTTATT TGTTGCCAGT TTTATGATGA TAGGAATTGT TCTGGGCAGA 
AATATAAAAA AAATTGAGGT ACTTGTATTT TGCCTTTTGC TAATTTTAGG AGCGCTGTGT
GCTACCTACT ATTTTCTTCC TCAGTACTTT AAAAAGGAAA AGTTTATGTT TATTTTATGT
TTTCTTTTTC TCACGCTGCA GCTTTTCAGG ACTTATTACA TTTTCAATAT TCTTGAGCCT
CAAAAAAATC TGGATGGGAA ACATGTTTAT ATTGTTGGCA ATATCTGCTC ATTTCCCGAG
ATAAGCGATA AAAAGACTTC CTTTTACCTT AAAACAAAAC TAAATTCAAA GGCTGTTGTT
ATTAGAGTCA CAACAGAGTC TAAAAAAAGT ATTTTTTATG GAGATACTGT AAAAGTTTCT
GGAAAACTTA AAATTCCAAA AGGAAAGACA AGTAAATTTG GTTTTGATTA CAGAGAATAT
TTGAAAGGCA AAGGTGCTAT TTATACACTT TACTCAAAAG ACATAGAGGT TATCTATCAA
GGAAAAAATG TTCTCAATCT TCTCAATAGA TTTTCTACAC AGTTAAATAA CCTCATAGAT
AGCTCTTTTG AAAATGATAT ATCTTCGCTT TTAAAAGGTT TGATTCTTGG CAACAAATCT
ACAATTCCAG ATGATGTGTA CAAAGACTTT CAGCGAAGCG GACTTGCTCA CCTTCTTGCA
GTCTCTGGTG GAAATGTAGG GGTGCTTTGC GCTTTTGTTG AGATTTTGTT CAGAAGAATA
TTAAAGATAT ATGGTAAAGG AGTAAACTTT TTAATAATAG GTGTCATAGT TATTTTTGCT
ATTGTCACAG GGATGTCAGC ATCAGTTGTT AGGGCTTCGA TTATGGCGAT AATCTTCTAT
GCTGGAAGGA TTATTTACAG AAATCCTGAT ACGCTCAATA GCCTATCTGT ATCAAGCGTT
TTGATGCTGC TTGTAAATCC GCTTTTTCTT TTTGACATTG GGTTCCAGCT GTCTTTTTTG
AGTGTTCTTT CAATAATTCT GTTTTGTAAA GGGATATATG AATATTTTGC GAAGTTAAAG
ATACCAAGAG GTATATCTTC ACTTATTGCA GTTTCAATTT CTGCTCAGAT TTTAATATTG
CCGTTGATAG CTTATTATTT TTCTGAGATC TCAGTTATTT CATTCTTGAC AAACATAGTT
GCTGTACCAG TTGCAGGTGC TGTTGTACCG GCTGGGCTGC TGTATTGTCT ATTATTGGTT
TTCAATATAG ATATATTACC ATTTAAATGG TTTTTAGAAG TCTGTGTGAA CGTGCTAATG
TACCTTTCAA GATTATCTTA TGTAGGATTT TCGCATGTAA AGGTCATTTT ATGGGATGAA
AAGCTAATAT TTTGTTACTA TCTTGTTGTG GCATCTTTAA TTTTTAGAAA ATTTATAAAC
AGGCAACTAA AATATGTGAT ATATTTGAGT ATTTGTGGAC TGCTTGTGGC ATTTATCTTA
CAGACACTTA TAAATTACAA CAGGCTCATC ATAAACGTGA TAGACGTAGG GCAGGGAGAC
AGTAGCTTTA TTACATACAA GGGATTTTCA ATGCTGATTG ACACAGGGCC TGAATATGAA
GATTTTAGCA GCTTGAAAAG AATTGTTCTT CCGTATATAC TCAAAAGAGG AGTAGCAAAA
CTTGATGTTT TAGTCTTGAC ACACAAGCAC AGCGACCATA TGGGGGACTT TGAGTATCTG
CTTTATGAGA TGAAAGTGGA CACAATTGTA ACATCAAAAG AGGTATATTT TGAAAATGCT
CAAAAGTTCA AAGGGCAAAA GGTTGTGTTA GTAGATAGCT TGAAAGTTTA TCGCTACAAG
GATTTAAAAG CCTATTTTAT CCCACCAGTA GAGGAAGATG AAAATAGTTC TGTTGTTGTG
AAGCTGACCC TTGGCAATTT TTCTATGCTA TTTACAGGTG ATGCCTCATA TGAGTCTGAA
AAGGAATACG TAAAGAAATA TAACTTGCAG ACAAAGATCT TAAAGGTGGG ACACCATGGA
AGCAGCACAG CAACATCTGA AGAGTTTTTG GAAAATGTAA AGCCAACATT TGCAGTAATT
TCTGTAGGGA AAGACAACAT CTTTGGGCAT CCTTCGAATG AGGTCTTACA AAGACTCAAA
GACAGAAACA TTAAGGTGTA TAGAACAGAT TTAAATGGAA CCATAGACAT TATAGTTGAC
AGAAATAAGA TGATGGTAAA TCCGTATATT GTGAGGTGA
 
Protein sequence
MTRKALFVAS FMMIGIVLGR NIKKIEVLVF CLLLILGALC ATYYFLPQYF KKEKFMFILC 
FLFLTLQLFR TYYIFNILEP QKNLDGKHVY IVGNICSFPE ISDKKTSFYL KTKLNSKAVV
IRVTTESKKS IFYGDTVKVS GKLKIPKGKT SKFGFDYREY LKGKGAIYTL YSKDIEVIYQ
GKNVLNLLNR FSTQLNNLID SSFENDISSL LKGLILGNKS TIPDDVYKDF QRSGLAHLLA
VSGGNVGVLC AFVEILFRRI LKIYGKGVNF LIIGVIVIFA IVTGMSASVV RASIMAIIFY
AGRIIYRNPD TLNSLSVSSV LMLLVNPLFL FDIGFQLSFL SVLSIILFCK GIYEYFAKLK
IPRGISSLIA VSISAQILIL PLIAYYFSEI SVISFLTNIV AVPVAGAVVP AGLLYCLLLV
FNIDILPFKW FLEVCVNVLM YLSRLSYVGF SHVKVILWDE KLIFCYYLVV ASLIFRKFIN
RQLKYVIYLS ICGLLVAFIL QTLINYNRLI INVIDVGQGD SSFITYKGFS MLIDTGPEYE
DFSSLKRIVL PYILKRGVAK LDVLVLTHKH SDHMGDFEYL LYEMKVDTIV TSKEVYFENA
QKFKGQKVVL VDSLKVYRYK DLKAYFIPPV EEDENSSVVV KLTLGNFSML FTGDASYESE
KEYVKKYNLQ TKILKVGHHG SSTATSEEFL ENVKPTFAVI SVGKDNIFGH PSNEVLQRLK
DRNIKVYRTD LNGTIDIIVD RNKMMVNPYI VR