Gene Athe_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0101 
Symbol 
ID7408463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp122659 
End bp123747 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content33% 
IMG OID643714509 
ProductMonosaccharide-transporting ATPase 
Protein accessionYP_002572032 
Protein GI222528150 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID[TIGR02634] D-xylose ABC transporter, substrate-binding protein 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATA TTTATAAAAG TAAAAAAACA TCAAAAAGAA GTAGAGCTAA AATTTTTTGG 
GTTATTTTTT TGTTGATTTT TGTTGTAGCA GGCATTGTGA TTTTAATTGC ACATATTCCT
GATATTTCTA AAAATGAACA GAAGGTTTTT AAACCATCAA AAGTGAGAAT TGGCTTTGCA
ATGGGTACAC TAAAGGAGGA AAGATGGTTC AAAGACAGGG ACATCTTGAT TGCAAAAGCA
CATGAAAAAG GATATGAGGT TGAATGGGTC AACGCAAATG AGAACGATGT TGAACAAATA
AATCAGGTGA AATATCTTTT GAGCAAAAAT ATAAATATTT TGATTATTGT TCCTAACAAC
TATGAAAAAT GTAGCAGTGC AGTAAATCTT GCTAAAAAGA AAGGAATAAA AGTTATAAGT
TATGACAGAC TTGTGAAAAA CAGTGACATA GATGTATATG TCTCTTTTAA CAATTACAAA
GTAGGAGAGC TTATGGCAAA ATGGCTTTTG AAAAAAGTTC CCTATGGAAA CTACGTCTTT
CTACTTGGTG ACCCAGGGGA TTATAACGTT CAGATGATAA AGGAAGGCTA TCACAAAGTA
TTAGATTCAC TTATTCAGAA AAAACAAATC AATAGTCTTT TAGAAAAATA CTGTTATAAC
TGGAGAAAGG AATATGCATA TAATTATGTC AATAACCTTT TAGAAGAGGG AAAAAGAATT
GATGCAGTTT TAGCTTCTAA CGATTCACTT GCTGAGGGTG CGATTATGGC ACTTTCGGAA
AAGCGGCTTG CTGGCAGTGT ACCTGTTACA GGCCAGGATG CAGACATCTC AGCATGTCAA
AGGATTGTCA AGGGCACTCA GCTTATGACT GTCTATAAGC CCATCGATAA GCTTGTTGAC
CTCACGCTTG ATATAGTTGA CAGGCTAATA AAAGGCAAAC TTCTAAAGCC TAATTACACT
ATTAATAATG GTTACAAAAA CGTTCCAACT TTTTTTATTG ACCCAATAGG TGTTGACAAA
ACCAATATTA ATGATACTGT TATAAAAGAC AATTTTCATA CATGGGATGA GGTATATATA
ACAAAGTAG
 
Protein sequence
MKHIYKSKKT SKRSRAKIFW VIFLLIFVVA GIVILIAHIP DISKNEQKVF KPSKVRIGFA 
MGTLKEERWF KDRDILIAKA HEKGYEVEWV NANENDVEQI NQVKYLLSKN INILIIVPNN
YEKCSSAVNL AKKKGIKVIS YDRLVKNSDI DVYVSFNNYK VGELMAKWLL KKVPYGNYVF
LLGDPGDYNV QMIKEGYHKV LDSLIQKKQI NSLLEKYCYN WRKEYAYNYV NNLLEEGKRI
DAVLASNDSL AEGAIMALSE KRLAGSVPVT GQDADISACQ RIVKGTQLMT VYKPIDKLVD
LTLDIVDRLI KGKLLKPNYT INNGYKNVPT FFIDPIGVDK TNINDTVIKD NFHTWDEVYI
TK