Gene Athe_1927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1927 
Symbol 
ID7407340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2034323 
End bp2036362 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content38% 
IMG OID643716299 
ProductBeta-galactosidase 
Protein accessionYP_002573788 
Protein GI222529906 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000283864 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAAAA TAAAGCTGAA AAAATTTTTG CATGGTGGAG ACTACAATCC AGACCAGTGG 
ACAGAGGATG TGTGGGAAAA GGACATTGAA TATATGAAAT ACTATAATGT AAATGCAGTT
TCTATGCCAA TATTTTCATG GGCACAGCTT CAGCCAAATG AAGATGAATT TACATTTGAA
TGGCTTGACA AAATAATTGA TAAGCTCTAT TCAAATGGTA TTCATGTTAT CTTGGCAACA
CCTACGGCTT CTCAGCCAGC ATGGCTTTCT AAAAAGTATC CTGATGTGCT TCCTGTTGAT
ATTCATGGAA GAAAGAGAAA ACATGGAGCA AGGCAGAATT ACTGCCCAAA CAGTCCAAAC
TTCAAAAATG CAGCAAGAAG AATTGTTGAG GAGATGGTAA AAAGGTATAA AGACCATCCT
GCAGTTATAA TGTGGCATAT CAGTAACGAA TATGGTCCTT ACTGCTACTG TGAAAACTGT
GCAAAAGCCT TTAGAGAGTG GCTAAAAGAA AGATATAAAA CATTGGATGA GCTCAACAAA
AGATGGAACA CAGCTTTCTG GGGACATACA TTTTATGATT GGGATGAGAT AGAAGTTCCC
TCATATCTGA ACGAAGAGTT TGAATATATG CCTGGAAGGC AGAAAAGCTC ATTCCAGGGA
CCTTCGCTTG ATTACAAGAG GTTTATGTCA GACAGCCTTC TGAATCTTTA TAAAATGGAA
GTTGAGATTA TCAAAAAATA CATGCCAGAT ATCCCTGTTA CAACAAACCT GATGGGCCCA
TTTAAGCCTC TTGACTATCA CAAATGGGCA AAACATATGG ATATTGTATC ATGGGACAAT
TACCCATCGA TAAAAGATTC TCCAAGTTCT ATTGCTTTCA AGCATGACCT CATGAGAGGG
CTCAAAAGAG ACCAATCGTG GATTTTGATG GAACAAACAC CGAGCCAGAC AAATTGGCAT
TGGTACAACT CTGCAAAAAG ACCTGGTATG ATAAGGCTTT TGAGCTATCA TGCAATTGCA
CATGGAGCTG ACTCTGTGCT GTATTTCCAG TGGAGACAGT CAGTTGCTTC GTGCGAAAAG
TTTCACTCTG CGATGGTTCC GCATGTTGGA CACCTTGAGA CGAGGGTGAG CAAAGAGCTT
AAAAAGATTG GCGATGAACT TTTGCGCTTA GATGAGATTT TGGAGTCAAC AACTAAGAGC
GAGGTTGCAC TATTATTTGA CTGGGAAAAC TGGTGGGCGC TTGAAGAGAG TATGGGATTT
AGAAATGATA TATCTTACCT TGAACATATA GATGCTTACT ATAAAGCGCT GTATAAGCTA
AAAACAAATG TGGATGTTGT TGACCCGAAA GAAGATTTAA CAAGGTACAA ACTTGTTGTT
GCACCACTTT TGTATCTTCT TGATAAAGAG ACTGCAAAGA ATATAGAAAA TTATGTAAAA
AACGGTGGAA TATTTATTAC AACATATTTA TCAGGACTTG TTGATGAAAA TGACAGAGTA
ATTCTTGGCG GCTATCCGGG TTGGTTTAGG AAACTCTGTG GTATCTGGGT TGAGGAGATT
GATGCGCTTT TCCCTGATAT GAAAAATGCA ATTATACTTG AAAAACCTAT TGGCATGCTT
GATGGCAAAT ACGAATGTGA TTTTATCTGT GACGTTATTC ACCTTGAGGG TGCAAGGGCG
CTTGCTTACT ATGAGCAGGA TTATTACCGC GGAATGCCAG CTGTTGTTGA AAATAATTAT
GGAAATGGAA AGGCGATTTA TATTGGAACA AGACCAGAAC AAAGGTTTAT AGAAGGTCTT
GTTAAGTTCT ACGCTGAAAA GGCTGGTGTA CAACCAATAT TACTTGTGCC GGAAGGTGTT
GAAGTAACAA AAAGAGAAAA GAATGGGAAT GAATATGTGT TTCTTTTGAA TTTCAATGGT
TATGATGTAA ATATTGAGCT TAAAGATGAG TATTATGAGC TTATAACACA GAAGATTTTG
GGCGGAAAAG CTACTCTTGC CCCGAAGGAG GTTATGATAC TGAGAAGATT AAAAGATTAA
 
Protein sequence
MGKIKLKKFL HGGDYNPDQW TEDVWEKDIE YMKYYNVNAV SMPIFSWAQL QPNEDEFTFE 
WLDKIIDKLY SNGIHVILAT PTASQPAWLS KKYPDVLPVD IHGRKRKHGA RQNYCPNSPN
FKNAARRIVE EMVKRYKDHP AVIMWHISNE YGPYCYCENC AKAFREWLKE RYKTLDELNK
RWNTAFWGHT FYDWDEIEVP SYLNEEFEYM PGRQKSSFQG PSLDYKRFMS DSLLNLYKME
VEIIKKYMPD IPVTTNLMGP FKPLDYHKWA KHMDIVSWDN YPSIKDSPSS IAFKHDLMRG
LKRDQSWILM EQTPSQTNWH WYNSAKRPGM IRLLSYHAIA HGADSVLYFQ WRQSVASCEK
FHSAMVPHVG HLETRVSKEL KKIGDELLRL DEILESTTKS EVALLFDWEN WWALEESMGF
RNDISYLEHI DAYYKALYKL KTNVDVVDPK EDLTRYKLVV APLLYLLDKE TAKNIENYVK
NGGIFITTYL SGLVDENDRV ILGGYPGWFR KLCGIWVEEI DALFPDMKNA IILEKPIGML
DGKYECDFIC DVIHLEGARA LAYYEQDYYR GMPAVVENNY GNGKAIYIGT RPEQRFIEGL
VKFYAEKAGV QPILLVPEGV EVTKREKNGN EYVFLLNFNG YDVNIELKDE YYELITQKIL
GGKATLAPKE VMILRRLKD