Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2062 |
Symbol | |
ID | 7408275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2179263 |
End bp | 2181101 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643716429 |
Product | glycosyl transferase, WecB/TagA/CpsF family |
Protein accession | YP_002573912 |
Protein GI | 222530030 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1922] Teichoic acid biosynthesis proteins |
TIGRFAM ID | [TIGR00696] bacterial polymer biosynthesis proteins, WecB/TagA/CpsF family [TIGR03609] polysaccharide pyruvyl transferase CsaB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000046039 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATA TTGTAATATC AGGATATTAT GGACAATTGA ATACTGGTGA TGAAGCCATA CTAAGGGTTT TGATAGATAG GCTGAGAGAA TATGAAAGAC AAGAAAATGA AGATTTAAAT GTTGTTGTTC TTTCATCAAG ACCACAACTG ACAAGCAAGA TTTACAGTGT AGACTCTGTC AATAGAAAAA AGATTTGGGA AGTTGTAAAA ACCATAAAAA GATGTGATAT ATTTATTTCC GGTGGAGGAA GTCTTTTTCA AAATGAGACA AGCAACAGAA GTCTTTATTA TTACCTCTTT CAAATTTTTC TTGCCAAGCT ATTTGGCAAG AAGGTGTTTA TTTTTTCTCA AGGGATAGGA CCAATAAAAA AGTGGTATAA TATTTTGATT TTCAAGCATA TAATCAAACT TGCTGATTAT ATTACAGTGA GAGACTACGA TTCTTTTGAC CTTTTACACA GACTGAAGCT TAAAAACAAA ATAGACTTGT CAGCAGACCC TGCATTTTTA CTGAATCCTT GCTGTGAAAA AAGAGTAGAA AAATTATTGC AAACTTATAA CATAGATTTG AGCAAAAAGA CGATAGGAAT AGTAGTAAGG AAATGGAAAA AGGAAAAGGA TATGACCGAC AAAATTGCAC AGATTGCGGA CATTCTTATA GAAAATGAAG GATATAATGT AGTTTTTATT CCTTTTCAGG GTAAGTGGGA CATAATAAAG ATAAATGAGA TCATTTCAAA AATGAAGAAT AAACCATATA TTCTTTCTGA GAGTTTTCAG CCTCATGAAC TTTTAGGTAT TTTTAGATGT TTCGACCTAA TAGTTGGTAT GCGTCTTCAT AGTCTCATAT TTGCATCTAA AATGAACAAG AGGTTTGTTG GAATATCGTA CGATCCCAAA ATTGACAGTT TTCTTAAAAT GTATGGTTTA AAACCGGCGG GATATGTTGA TAGTTTTGAT GTAAACAATG TTCTTTTAAA TATCCAGTAT ATGCTCAATG AGTCAAAAAT TCAAAAGAAA ATTGAACAGA TAACAATCAA TATGATTCAA AGAGCGGAAA AGTCTTTTGA GATTTTAAAA GAGGCTTTAT CAACAACTAG AAAAAGAAAA AACATTAATA TTTTGGGTGT GAGAATTGAT TGTATTAATT TTAATAAGGC AAAGAAAAAA TGTATTGACT TTTTATCTTC ATCTTCACCC AAAGTAGTAT TTACACCCAA TGTAGAGATG ATAATGCTAT CTCAAAGGGA CGAGAAATTT AAAAAAATTT TGAATTCGAG TGATTTAAAT GTACCTGACG GAATTGGGGT TGTTTGGGCA TCTAAATATT TTGGAGAAAA ACTGTATGAA AGGGTCACAG GTTTTGACCT GATGATGTCT TTGATGCCAG AGCTTGAGAA AAAAAGAGCA AGAGTATTCT TACTTGGTGC AAAACCAGGG ATTGCTGAAA AAGCGAAAGA AAATCTTTTA AAACAGTTTA AAAATTTAGA AATTTGTGGG ATTTATCATG GATATTTTAG TGAAGAAGAG AATAACACGG TTGTGGAGAT TATAAATTCC TCAAAAGCAG ATGTTTTGTT TGTTGCAATG GGCATGAAAA AACAGGAAGA ATGGATTTAC AAAAATAAAA AGAAACTCAA ATGTAAGCTT ATTATGGGTG TTGGTGGCAG CCTTGATGTT TTGTCAGGTG AGGTAAAAAG AGCACCCAAA ATGTTCCAAA GACTTGGACT TGAATGGTTT TATAGACTGA TAACCCAGCC ATGGAGGTTC AAAAGGATGC TTGCACTTCC TAAATTTGTA TTTGTTGTTT TGAAAAACAG AATATTTGGG GGAAGATAA
|
Protein sequence | MKNIVISGYY GQLNTGDEAI LRVLIDRLRE YERQENEDLN VVVLSSRPQL TSKIYSVDSV NRKKIWEVVK TIKRCDIFIS GGGSLFQNET SNRSLYYYLF QIFLAKLFGK KVFIFSQGIG PIKKWYNILI FKHIIKLADY ITVRDYDSFD LLHRLKLKNK IDLSADPAFL LNPCCEKRVE KLLQTYNIDL SKKTIGIVVR KWKKEKDMTD KIAQIADILI ENEGYNVVFI PFQGKWDIIK INEIISKMKN KPYILSESFQ PHELLGIFRC FDLIVGMRLH SLIFASKMNK RFVGISYDPK IDSFLKMYGL KPAGYVDSFD VNNVLLNIQY MLNESKIQKK IEQITINMIQ RAEKSFEILK EALSTTRKRK NINILGVRID CINFNKAKKK CIDFLSSSSP KVVFTPNVEM IMLSQRDEKF KKILNSSDLN VPDGIGVVWA SKYFGEKLYE RVTGFDLMMS LMPELEKKRA RVFLLGAKPG IAEKAKENLL KQFKNLEICG IYHGYFSEEE NNTVVEIINS SKADVLFVAM GMKKQEEWIY KNKKKLKCKL IMGVGGSLDV LSGEVKRAPK MFQRLGLEWF YRLITQPWRF KRMLALPKFV FVVLKNRIFG GR
|
| |