Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0101 |
Symbol | |
ID | 7408463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 122659 |
End bp | 123747 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643714509 |
Product | Monosaccharide-transporting ATPase |
Protein accession | YP_002572032 |
Protein GI | 222528150 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4213] ABC-type xylose transport system, periplasmic component |
TIGRFAM ID | [TIGR02634] D-xylose ABC transporter, substrate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACATA TTTATAAAAG TAAAAAAACA TCAAAAAGAA GTAGAGCTAA AATTTTTTGG GTTATTTTTT TGTTGATTTT TGTTGTAGCA GGCATTGTGA TTTTAATTGC ACATATTCCT GATATTTCTA AAAATGAACA GAAGGTTTTT AAACCATCAA AAGTGAGAAT TGGCTTTGCA ATGGGTACAC TAAAGGAGGA AAGATGGTTC AAAGACAGGG ACATCTTGAT TGCAAAAGCA CATGAAAAAG GATATGAGGT TGAATGGGTC AACGCAAATG AGAACGATGT TGAACAAATA AATCAGGTGA AATATCTTTT GAGCAAAAAT ATAAATATTT TGATTATTGT TCCTAACAAC TATGAAAAAT GTAGCAGTGC AGTAAATCTT GCTAAAAAGA AAGGAATAAA AGTTATAAGT TATGACAGAC TTGTGAAAAA CAGTGACATA GATGTATATG TCTCTTTTAA CAATTACAAA GTAGGAGAGC TTATGGCAAA ATGGCTTTTG AAAAAAGTTC CCTATGGAAA CTACGTCTTT CTACTTGGTG ACCCAGGGGA TTATAACGTT CAGATGATAA AGGAAGGCTA TCACAAAGTA TTAGATTCAC TTATTCAGAA AAAACAAATC AATAGTCTTT TAGAAAAATA CTGTTATAAC TGGAGAAAGG AATATGCATA TAATTATGTC AATAACCTTT TAGAAGAGGG AAAAAGAATT GATGCAGTTT TAGCTTCTAA CGATTCACTT GCTGAGGGTG CGATTATGGC ACTTTCGGAA AAGCGGCTTG CTGGCAGTGT ACCTGTTACA GGCCAGGATG CAGACATCTC AGCATGTCAA AGGATTGTCA AGGGCACTCA GCTTATGACT GTCTATAAGC CCATCGATAA GCTTGTTGAC CTCACGCTTG ATATAGTTGA CAGGCTAATA AAAGGCAAAC TTCTAAAGCC TAATTACACT ATTAATAATG GTTACAAAAA CGTTCCAACT TTTTTTATTG ACCCAATAGG TGTTGACAAA ACCAATATTA ATGATACTGT TATAAAAGAC AATTTTCATA CATGGGATGA GGTATATATA ACAAAGTAG
|
Protein sequence | MKHIYKSKKT SKRSRAKIFW VIFLLIFVVA GIVILIAHIP DISKNEQKVF KPSKVRIGFA MGTLKEERWF KDRDILIAKA HEKGYEVEWV NANENDVEQI NQVKYLLSKN INILIIVPNN YEKCSSAVNL AKKKGIKVIS YDRLVKNSDI DVYVSFNNYK VGELMAKWLL KKVPYGNYVF LLGDPGDYNV QMIKEGYHKV LDSLIQKKQI NSLLEKYCYN WRKEYAYNYV NNLLEEGKRI DAVLASNDSL AEGAIMALSE KRLAGSVPVT GQDADISACQ RIVKGTQLMT VYKPIDKLVD LTLDIVDRLI KGKLLKPNYT INNGYKNVPT FFIDPIGVDK TNINDTVIKD NFHTWDEVYI TK
|
| |