Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1927 |
Symbol | |
ID | 7407340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2034323 |
End bp | 2036362 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716299 |
Product | Beta-galactosidase |
Protein accession | YP_002573788 |
Protein GI | 222529906 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000283864 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAAAA TAAAGCTGAA AAAATTTTTG CATGGTGGAG ACTACAATCC AGACCAGTGG ACAGAGGATG TGTGGGAAAA GGACATTGAA TATATGAAAT ACTATAATGT AAATGCAGTT TCTATGCCAA TATTTTCATG GGCACAGCTT CAGCCAAATG AAGATGAATT TACATTTGAA TGGCTTGACA AAATAATTGA TAAGCTCTAT TCAAATGGTA TTCATGTTAT CTTGGCAACA CCTACGGCTT CTCAGCCAGC ATGGCTTTCT AAAAAGTATC CTGATGTGCT TCCTGTTGAT ATTCATGGAA GAAAGAGAAA ACATGGAGCA AGGCAGAATT ACTGCCCAAA CAGTCCAAAC TTCAAAAATG CAGCAAGAAG AATTGTTGAG GAGATGGTAA AAAGGTATAA AGACCATCCT GCAGTTATAA TGTGGCATAT CAGTAACGAA TATGGTCCTT ACTGCTACTG TGAAAACTGT GCAAAAGCCT TTAGAGAGTG GCTAAAAGAA AGATATAAAA CATTGGATGA GCTCAACAAA AGATGGAACA CAGCTTTCTG GGGACATACA TTTTATGATT GGGATGAGAT AGAAGTTCCC TCATATCTGA ACGAAGAGTT TGAATATATG CCTGGAAGGC AGAAAAGCTC ATTCCAGGGA CCTTCGCTTG ATTACAAGAG GTTTATGTCA GACAGCCTTC TGAATCTTTA TAAAATGGAA GTTGAGATTA TCAAAAAATA CATGCCAGAT ATCCCTGTTA CAACAAACCT GATGGGCCCA TTTAAGCCTC TTGACTATCA CAAATGGGCA AAACATATGG ATATTGTATC ATGGGACAAT TACCCATCGA TAAAAGATTC TCCAAGTTCT ATTGCTTTCA AGCATGACCT CATGAGAGGG CTCAAAAGAG ACCAATCGTG GATTTTGATG GAACAAACAC CGAGCCAGAC AAATTGGCAT TGGTACAACT CTGCAAAAAG ACCTGGTATG ATAAGGCTTT TGAGCTATCA TGCAATTGCA CATGGAGCTG ACTCTGTGCT GTATTTCCAG TGGAGACAGT CAGTTGCTTC GTGCGAAAAG TTTCACTCTG CGATGGTTCC GCATGTTGGA CACCTTGAGA CGAGGGTGAG CAAAGAGCTT AAAAAGATTG GCGATGAACT TTTGCGCTTA GATGAGATTT TGGAGTCAAC AACTAAGAGC GAGGTTGCAC TATTATTTGA CTGGGAAAAC TGGTGGGCGC TTGAAGAGAG TATGGGATTT AGAAATGATA TATCTTACCT TGAACATATA GATGCTTACT ATAAAGCGCT GTATAAGCTA AAAACAAATG TGGATGTTGT TGACCCGAAA GAAGATTTAA CAAGGTACAA ACTTGTTGTT GCACCACTTT TGTATCTTCT TGATAAAGAG ACTGCAAAGA ATATAGAAAA TTATGTAAAA AACGGTGGAA TATTTATTAC AACATATTTA TCAGGACTTG TTGATGAAAA TGACAGAGTA ATTCTTGGCG GCTATCCGGG TTGGTTTAGG AAACTCTGTG GTATCTGGGT TGAGGAGATT GATGCGCTTT TCCCTGATAT GAAAAATGCA ATTATACTTG AAAAACCTAT TGGCATGCTT GATGGCAAAT ACGAATGTGA TTTTATCTGT GACGTTATTC ACCTTGAGGG TGCAAGGGCG CTTGCTTACT ATGAGCAGGA TTATTACCGC GGAATGCCAG CTGTTGTTGA AAATAATTAT GGAAATGGAA AGGCGATTTA TATTGGAACA AGACCAGAAC AAAGGTTTAT AGAAGGTCTT GTTAAGTTCT ACGCTGAAAA GGCTGGTGTA CAACCAATAT TACTTGTGCC GGAAGGTGTT GAAGTAACAA AAAGAGAAAA GAATGGGAAT GAATATGTGT TTCTTTTGAA TTTCAATGGT TATGATGTAA ATATTGAGCT TAAAGATGAG TATTATGAGC TTATAACACA GAAGATTTTG GGCGGAAAAG CTACTCTTGC CCCGAAGGAG GTTATGATAC TGAGAAGATT AAAAGATTAA
|
Protein sequence | MGKIKLKKFL HGGDYNPDQW TEDVWEKDIE YMKYYNVNAV SMPIFSWAQL QPNEDEFTFE WLDKIIDKLY SNGIHVILAT PTASQPAWLS KKYPDVLPVD IHGRKRKHGA RQNYCPNSPN FKNAARRIVE EMVKRYKDHP AVIMWHISNE YGPYCYCENC AKAFREWLKE RYKTLDELNK RWNTAFWGHT FYDWDEIEVP SYLNEEFEYM PGRQKSSFQG PSLDYKRFMS DSLLNLYKME VEIIKKYMPD IPVTTNLMGP FKPLDYHKWA KHMDIVSWDN YPSIKDSPSS IAFKHDLMRG LKRDQSWILM EQTPSQTNWH WYNSAKRPGM IRLLSYHAIA HGADSVLYFQ WRQSVASCEK FHSAMVPHVG HLETRVSKEL KKIGDELLRL DEILESTTKS EVALLFDWEN WWALEESMGF RNDISYLEHI DAYYKALYKL KTNVDVVDPK EDLTRYKLVV APLLYLLDKE TAKNIENYVK NGGIFITTYL SGLVDENDRV ILGGYPGWFR KLCGIWVEEI DALFPDMKNA IILEKPIGML DGKYECDFIC DVIHLEGARA LAYYEQDYYR GMPAVVENNY GNGKAIYIGT RPEQRFIEGL VKFYAEKAGV QPILLVPEGV EVTKREKNGN EYVFLLNFNG YDVNIELKDE YYELITQKIL GGKATLAPKE VMILRRLKD
|
| |