Gene Athe_1783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1783 
Symbol 
ID7408570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1856777 
End bp1857799 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content39% 
IMG OID643716160 
Productbasic membrane lipoprotein 
Protein accessionYP_002573649 
Protein GI222529767 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000546554 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGAA GCTTGTATGC GGTTTTAAGT CTTGTTGTGA TTGCTGCTCT TTTATTAAGC 
TTAAATGTAT CTTGGAACAT GGCAAGTGGG TCTACATCCC AAAAAACTTT TAAGGTAGGG
CTTGTCACTG ACGTTGGTGG TGTCAATGAC AGAAGTTTCA ACCAGTCTGC ATATGAAGGA
CTGAAAAGAG CTGAAAAAGA ACTAAAAATC AAAACAACTC TCATTCAGTC AAAGCAGATG
ACAGACTATG TTCCAAACTT ACAGAAACTT GCAAAAGCTA ATTACGATTT AATTATTGCA
GTTGGTTTTT TGATGCACGA CTCGGTTGTG ACAGTTGCAA AGCAGTTCCC AAAGGCAAAA
TTTTTAATTA TCGACTCTGA GATTTCTGAC CTTCCTAATG TTGCCTCAGC TATGTTTAGA
GAAGAACAAG CAGGCTACTT AGCAGGAGTT GCTGCAGCTT TGCTTGAGAA AGCTAAGTTT
GGTAAAACAA CTGGAAAGAA TATATTTGGA GTTGTAGGTG GCATGAAGAT TCCACCTGTT
GACAGATACA TTGCAGGTTT TAAAGCTGGC GTCTTGAGCG AGATTCCAAA AGCAAAGGTT
ATAATCAAGT ACACTGGCAA GTTTGATGAC CCGGCATCAG GAAAACAGGT AGCTCTTTCA
GAAATTGCAC AGGGTGCTGA CTTTGTGTTC CAAGTTGCAG GGCAGACAGG TCTTGGTGTT
ATCCAGGCTG CAAAAGAAAA GGGAGTTTAT GCAATCGGTG TTGACTCAGA CCAAAGCTAT
GTTGCACCAG CCACTGTTGT TACCTCTGCA ATGAAGAGAG TTGACGTTGC AACATACAGT
GTTATAAAAG ATACATTAAA TGGGAAATTC AAAAGTGGTA TCATTTACTT TGATTTGAAA
AACAATGGCG TTGGACTTGC TCCGTTTATG AAAGGTGTTC CAAATAGTGT GAGTGCAAAA
ATAAACAAGG TAATTGCTGA TATAAAAGCT GGTAAGATAA AGATACCAAC AGAAGTAAAG
TAG
 
Protein sequence
MKRSLYAVLS LVVIAALLLS LNVSWNMASG STSQKTFKVG LVTDVGGVND RSFNQSAYEG 
LKRAEKELKI KTTLIQSKQM TDYVPNLQKL AKANYDLIIA VGFLMHDSVV TVAKQFPKAK
FLIIDSEISD LPNVASAMFR EEQAGYLAGV AAALLEKAKF GKTTGKNIFG VVGGMKIPPV
DRYIAGFKAG VLSEIPKAKV IIKYTGKFDD PASGKQVALS EIAQGADFVF QVAGQTGLGV
IQAAKEKGVY AIGVDSDQSY VAPATVVTSA MKRVDVATYS VIKDTLNGKF KSGIIYFDLK
NNGVGLAPFM KGVPNSVSAK INKVIADIKA GKIKIPTEVK