Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1917 |
Symbol | |
ID | 7407330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2022362 |
End bp | 2023282 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643716289 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_002573778 |
Protein GI | 222529896 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000794188 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCAAGAT ACATACTCAA AAGGATAGTG TGGTCTATTG TATCATTATT CGTCATAGTT ACTGTTACAT TTTTTCTTAT GAGAATGATA CCAGGTGGTC CATTCACGGG TGAAAAGACT TTGCCTGAGC AGATTTTGCA AAACCTGAAC GAGAAATATG GACTTAACAA ACCGCTTGGA GTGCAATATT TCAAATATTT AAATAGCCTT TTACACGGTG ATTTGGGAAT TTCAATGAGA AATCAAGGTA GAACAGTTAA TGAGATTATT GCAGAAACGT TTCCCATTTC TGCTAAGGTG GGTATTTTGG CTATAATTTT GAGTTTGCTG ATAGGGATAC CGCTTGGTAT CTGGTCTGCC GTACATCAAG GTAAATGGCA GGATAATTTG TCTATGATTA TAGCAACCAT TTTCATTACG ATACCTGGAT TTGTACTTGC TGTAATTTTA ATGTATATCT TTGGTGTAAA GCTTCAACTT GTACCTATAA TGGGATTAGA TGAACCTAAA AGCTATGTTC TTCCTGTTGT TACACTGGCA GCATATCCAA TATCTTTTAT TGCAAGGCTT ATTCGAAGCA GTATGCTTGA AAGTTTATCA CAGGACTATA TTAGAACTGC ACGCGCAAAA GGACTTTCAG ATTTCATAGT CATATACAAA CATGCGCTGA AAAATTCTTT GATACCTGTT GTTACGTATT TGGGTCCTTT AATTGCAGGT ATACTTACTG GTAGTTTTGT TGTTGAAAAG ATTTTCTCAA TCCCAGGAAT GGGGAGGTTC TATGTTGATA GTATATCTAA CAGGGACTAT TCGCTTGTGA TGGGAACCAC AATATTTTAT GCAGCATTTT TGATATTTAT GAACCTAATT GTTGACATTA TCTATGTATT TATAGACCCG CGTATAAAAC TTGAGGACTG A
|
Protein sequence | MARYILKRIV WSIVSLFVIV TVTFFLMRMI PGGPFTGEKT LPEQILQNLN EKYGLNKPLG VQYFKYLNSL LHGDLGISMR NQGRTVNEII AETFPISAKV GILAIILSLL IGIPLGIWSA VHQGKWQDNL SMIIATIFIT IPGFVLAVIL MYIFGVKLQL VPIMGLDEPK SYVLPVVTLA AYPISFIARL IRSSMLESLS QDYIRTARAK GLSDFIVIYK HALKNSLIPV VTYLGPLIAG ILTGSFVVEK IFSIPGMGRF YVDSISNRDY SLVMGTTIFY AAFLIFMNLI VDIIYVFIDP RIKLED
|
| |