Gene Athe_2164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2164 
Symbol 
ID7408357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2293272 
End bp2294738 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content40% 
IMG OID643716529 
Productprotein of unknown function DUF1078 domain protein 
Protein accessionYP_002574012 
Protein GI222530130 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR01396] flagellar basal-body rod protein FlgB
[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGGT CAATGTTTTC ATCTATCTCT GCTCTCCGCG CTCACCAGAC AAGAATGGAC 
GTTATTGGTG ATAACATCGC CAATGTAAAT ACAGTGGGGT TTAAGTCAAG CAGAGTGACA
TTTGCCTCTG TATTTGCTTC TGTTTTAAAG TCAGCGTCAG CACCAGATAC AGCGTCAGGT
CGAGGCGGGT CAAACCCTAT GCAGATTGGT CTTGGTGTGT CGGTTGCCTC TGTTGATATG
AACATGACAA GAGGAAGCCT TCAGAGAACA GATAATCCAA CAGACCTTGC AATTGAAGGT
GATGGATTTT TTGTTGTGGG AGGGGACGGA AAAGCTCCGC GATTCACCAG AGCGGGGAAT
TTCAGTTTAG ACAAGATGGG GAATTTAGTG ACCGCAACAG GATTAAATGT TCTGGGATGG
ATGTATGATC CTGTAAACAA TCAAATTGAT ACAACAAAAT CGCCTTCAAA GATCAACATA
CTTGCATTTC CAACTTTACC TCCAAAGGCA ACAGATAAGA TTAGTTTTGA CGGGAACCTA
AGCGCGGATA CAAAAATATA TTCAGGTCAA ATAACAAAAT TTGAAGATTT ATTGAACGTG
CCTGCCGATA GCAAATATTC AACAAGTTTT AAGATTTTCG ATTCACAGGG CAAAGAACAC
ACGTTACAGC TCACTTTTAT AAAAACAGGC GATAACACAT GGGAATGGTT TGTGGATGCT
CCGAGGGTAA AGAAGAATAT AGGTACTGCC CAAAATCCAC AAGAAGCATA TGTATATGTT
GATGATATGA TAGAGGCAAA CAATGACTAT GACAACTTTA TTGCAAGAGG AACAATAACA
TTTGGACAAG CAGGAAAGGT GCTTGATGAT GAAAATACAC CTGATGTAGA AGGAATTGCT
ATAACGGGTG GAAGGTTTAT TAATACACAA AACGGTACAT TTACAATTAA TTTCAAGAAC
AATGTTGTGA ATCCTGTTAC ATTAAAAGTT AATAGTTCTC AGTTTGATGT GAATGATGCC
ACAAACATTG CCTTTTTCTT GAAGAATATA ACTCAATTTG GTAATATGGA AAGTTCAATA
AGAGTTGCGC AGATGACAGG GTACAGTGCA GGAAGTCTTC AAGGATTTAA CGTTGATGCA
TCAGGTAAGA TAACAGGTGT ATATTCAAAT GGTTTGAACC AGCTAATTGG TCAGATTGCG
ATTGCAACAT TTGCAAACCC TGCAGGACTT CAGCGAATAG GCGATAATCT TTATATAAAC
ACAGTAAACT CAGGTGACCC TGAGATTGGA ACACCTGGGT CTGGCTCAAG AGGTACAATA
TCTCAGGGAA CGCTTGAGAT GTCAAATGTG GACTTAGCAA AAGAATTTAC AGACATGATA
GTAACTCAGA GAGGGTATCA GGCAAACGCA AGGGTGATAA CTGCATCAGA TGAACTTTTG
CAGGATTTGG TCAATATTAA AAGGTAA
 
Protein sequence
MMRSMFSSIS ALRAHQTRMD VIGDNIANVN TVGFKSSRVT FASVFASVLK SASAPDTASG 
RGGSNPMQIG LGVSVASVDM NMTRGSLQRT DNPTDLAIEG DGFFVVGGDG KAPRFTRAGN
FSLDKMGNLV TATGLNVLGW MYDPVNNQID TTKSPSKINI LAFPTLPPKA TDKISFDGNL
SADTKIYSGQ ITKFEDLLNV PADSKYSTSF KIFDSQGKEH TLQLTFIKTG DNTWEWFVDA
PRVKKNIGTA QNPQEAYVYV DDMIEANNDY DNFIARGTIT FGQAGKVLDD ENTPDVEGIA
ITGGRFINTQ NGTFTINFKN NVVNPVTLKV NSSQFDVNDA TNIAFFLKNI TQFGNMESSI
RVAQMTGYSA GSLQGFNVDA SGKITGVYSN GLNQLIGQIA IATFANPAGL QRIGDNLYIN
TVNSGDPEIG TPGSGSRGTI SQGTLEMSNV DLAKEFTDMI VTQRGYQANA RVITASDELL
QDLVNIKR