Gene Athe_1913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1913 
Symbol 
ID7407326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2017642 
End bp2019249 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content34% 
IMG OID643716285 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002573774 
Protein GI222529892 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.988615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC GTATTGTGGC TACTTTTATT TTAGTTGCGT TCCTTGTGAC AGGGTTATTT 
TTAGGTACAA ATTACAAAAA TGCAACAGCT GCTTCATCTA AACAGGTGCT TACTTATATT
AATGGTGCTG AACCAAGATA TCTTGACCCA GCTTTAAATA CTGCGCTTGA TGCAGCAAAT
ATTATTATTA ATGTTTTTGA AGGCTTGACA AGGGTTGATG TAAAAGGAAG AACTGTCCCA
GGAATGGCAG AAAAATGGAC AGTATCAAAG GATGGACTTA CTTATACTTT CTATATAAGA
AAGAATGCAA AGTGGTCAGA CGGTAAACCT GTTACAGCAT ACGATTTTGA GTATGCTTGG
AAAAGAGCAT TAGATCCGAA AACAGGTTCA GAATATGCTT ATCAGCTTTT CTACATCAAA
AATGGTCAAA AATTCTATGA AGGTAAAGCA AAAGCATCTG ATGTTGGTGT CAAAGCTTTA
AATGCTACAA CTTTACGGGT TACATTGGAA GCACCAACAC CATACTTTAT TGATCTTACA
AACTTCCCAA CATACTTCCC AGTAAGAAAA GATATTGTTG AGAAGTATGG TGATAAATGG
CAAACAGATC CAAAGACATA TATTGGAAAT GGTCCATTTA AAATGACAAA ATGGGTTCAT
AATTCTTATA TTGAACTTAC TAAGAACACC AATTACTGGG ATGCAAAGTC AATTACTTTA
CAAAAGATGG TGCTTAAATT ATCATCTGAT AACAATGCTA ATTTGATGGC TTTTACTGCA
GGGCAAGTTG ATGGTGCCGA AGGTATACCA ACCGAAGAAA TTCCAAGACT TAAAAAAGAA
GGAAAACTTA AAATAGCACC TTTATTAGGA ACATACTACT ATGATGTAAA TTGCAAGAAA
GCACCTTTTA ATGATAAGAG AGTAAGAGAG GCTTTATCAC TTGCTATTGA TAGGACACGT
ATTGTAGCTC TTCTAAAAGG TGAACAAAAG CCAGCAACAG GTTTTGTTCC ATATGGTGTT
AAAGGAATTT CTAAAGATTT CAGAAGTGAA GCAGGTAATT ACTTACCAGT GAATGCTGAT
TTAGCAAAAG CAAAGAAACT GTTAGCTGAA GCAGGGTATC CAAACGGAAA GAATTTCCCA
GATATTGAGA TTATTTATAA TACTGATGAA GGACATAAAA AAGTTGCCGA AGCTATTCAA
AATATGTGGA AACAACTTGG AATAAATGTT AAACTTTCTA ATATGGAATG GAAAGTATTG
CTTGAAAGAA GACAGAAAAA AGACTATATA GTAGCAAGAG ATGGATGGGT TGGCGATTAT
AACGATCCGA TGACTTTCTT AGATTTGTTT ACCTCATATA GTGGCAATAA TAACACAAAT
TGGAGCAATA AGCAATATGA TTCTCTAATA GATAAGGCTA AGAAGACCAT TGATGCTAAA
CAAAGAATGC AGTATATGAT ACAAGCAGAA AAAATATTGA TGCAAGACCA TGCAATAATT
CCAATCTATT TCTATACAAA AGTTTATCTT CTGAGAGACT ATGTAAAGAA TTACTATATT
TCTCCACTTG GATTTAACTA CTTCATGTAT GCCAAGATTG TAAAGTAA
 
Protein sequence
MKKRIVATFI LVAFLVTGLF LGTNYKNATA ASSKQVLTYI NGAEPRYLDP ALNTALDAAN 
IIINVFEGLT RVDVKGRTVP GMAEKWTVSK DGLTYTFYIR KNAKWSDGKP VTAYDFEYAW
KRALDPKTGS EYAYQLFYIK NGQKFYEGKA KASDVGVKAL NATTLRVTLE APTPYFIDLT
NFPTYFPVRK DIVEKYGDKW QTDPKTYIGN GPFKMTKWVH NSYIELTKNT NYWDAKSITL
QKMVLKLSSD NNANLMAFTA GQVDGAEGIP TEEIPRLKKE GKLKIAPLLG TYYYDVNCKK
APFNDKRVRE ALSLAIDRTR IVALLKGEQK PATGFVPYGV KGISKDFRSE AGNYLPVNAD
LAKAKKLLAE AGYPNGKNFP DIEIIYNTDE GHKKVAEAIQ NMWKQLGINV KLSNMEWKVL
LERRQKKDYI VARDGWVGDY NDPMTFLDLF TSYSGNNNTN WSNKQYDSLI DKAKKTIDAK
QRMQYMIQAE KILMQDHAII PIYFYTKVYL LRDYVKNYYI SPLGFNYFMY AKIVK