Gene Athe_1330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1330 
Symbol 
ID7408911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1415196 
End bp1416251 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content30% 
IMG OID643715695 
Productprotein of unknown function UPF0118 
Protein accessionYP_002573203 
Protein GI222529321 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCACATAA TAAAATTGGT CAAAAGATAT TTTACAGATA TATTGTTCAT AGCTCTAATT 
GCAATTGTTA TCTATTTTTT TACTAATATG AAGGCATTTT GGCCGATTCT GATTCCATTT
TTGATTGCAC TATTTTTGTC ATATCTCTTA AAACCTTGCG TAGATTTTTT AGAAACAAAG
ATTCGGTCAA GAGATATCTC AATCCTGATT TCGTTTGCAA TAATCTTTGG TATCACCATT
ATGGTATTTG TATATTTTAT TCCTTTATTT GTTAGCGAAA CTAAGCAGCT TATCCAAAAC
ATTCCTGATT ATATAATACT AATTCAAAAA TGGTTTTTTG AGATTGATTC TAAACTTTTG
AATAAACTAA ACATTGATAT TAAAGAAATA CTAAACGCTA ATTCAATCAA TATCGAAGGA
ATTTCCAAAC AAACATTATC AATATTTTTA AACATTGTAA AGAGTATTTC CTCTAACATT
TTGTATTATC TTCTTATTCC TATTATATCT TTTTATATCC TGAGGGATTG GAAAAGGTTA
GTCATGTGGA TAAAATGGTT ATTACCCGAG AAATACAGAA AAGAAGGACT TTATATCTTT
GTTGATATAA ATAGGGTTCT TCATCAGTAT ATTCGAGGGC AGCTTCTTGA TGCCTTTATA
GTTGGACTGC TCAGCTTTGT AGGATTTTCT CTGCTTTCTG TAAGATATGC AGCTCTTTTG
GGTGTAATAA CTGGTATTGG CAATTTGATT CCCTATTTTG GACCAATATT TAGCAGTATT
CCAGCAGTGA TAATAGCACT TTCTGACTCT TACATAAAGG CTATATTGGT TGTGATTTTT
TTAGTCCTAC TTCAGCAAGT TGACAGTTTT ATCATATCCC CACGAGTTAT TGGTTCAAAA
GTCGGGCTTC ATCCTCTTAC CATAATTATA GTTATAATCT TAGCAAACAA AATATTTGGG
TTTATTGCAA TGTTCTTTGC TATTCCTATT GCTGCAGTAA TAAAAATTAT ATTTATTAAT
ATCATGAAAA GGATAAAATC TGAGAAGATT GAGTGA
 
Protein sequence
MHIIKLVKRY FTDILFIALI AIVIYFFTNM KAFWPILIPF LIALFLSYLL KPCVDFLETK 
IRSRDISILI SFAIIFGITI MVFVYFIPLF VSETKQLIQN IPDYIILIQK WFFEIDSKLL
NKLNIDIKEI LNANSINIEG ISKQTLSIFL NIVKSISSNI LYYLLIPIIS FYILRDWKRL
VMWIKWLLPE KYRKEGLYIF VDINRVLHQY IRGQLLDAFI VGLLSFVGFS LLSVRYAALL
GVITGIGNLI PYFGPIFSSI PAVIIALSDS YIKAILVVIF LVLLQQVDSF IISPRVIGSK
VGLHPLTIII VIILANKIFG FIAMFFAIPI AAVIKIIFIN IMKRIKSEKI E