Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2802 |
Symbol | |
ID | 4809639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3303221 |
End bp | 3304204 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108222 |
Product | NLPA lipoprotein |
Protein accession | YP_001039194 |
Protein GI | 125975284 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000138117 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA AAGCGGCAAT TTTCATATTA CTGGTTTTGA TTATATTAAG CTTGGCAGGC TGCAGTAATG GAGACAGGGC GGCCGGAACG GCAAATAACG GGACAAATGC CAATACGGAA GTCAAAACGG TAAAAATTGC TTATCTGCCT ATTACCCATG CTCTTCCGCT TTATGTGGAA AATGAACTTG CAAATGAAAA CTTTAAAAAT TTTAAACTGG AGCTTGTAAA GTTTGGTTCG TGGACGGAAC TGGTGGATGC TTTGAATTCA GGAAAAGTGG ACGGTGCGTC CATGCTTATA GAACTTGCAA TGAAAGCAAA GGAGCAGGGG ATTGATTTAA AAGCGGTTGC CTTGGGTCAC AGAGACGGAA ATGTGGTGGT GGTATCCAAG GATATCAATA AAGTTGAAGA TTTGAAAGGA AAAAGCTTTG CCATACCAAG CAAGCTTTCA ACTCATAATA TTCTCTTACA TATTATGCTG AAAAACCATG GCCTTGCATA TAACGATGTA AATGTTGTTG AGCTTCCACC GCCGGAAATG GCGGCCGCTC TTGCGGAAGG CAGGATATCC GGCTATTGTG TGGCTGAGCC TTTTGGAGCA AAATCGGTGG CAGTGGATAA AGGTAAGACC TTGTTTGAGT CCCAGGATTT GTGGGAAGGT TCTGTGTGCT GCGGATTGGT TCTTAGAAAT GATTTTATCA AAAATAACGA GGCTATAGCG GAGGAATTTA TCAAAGAATA CATAAAAGCA GGGGAAAAAG CTGAAGCAAA AGATGAGACA ATCCGGGATA TTGCCACAAA ATATCTAAAA GCGGAGGAAC AAGTGCTGGA TTTGTCTCTT AAATGGATTT CCTATGAAAA CTTGAAACTT GAAGAAAAGG ATTACAATGA GCTTGCAGAA TACATGGTGG AAATGGGACT TTCCGAAAAT CCTCCGAAGT ACGACGAGTT TGTGGATAAT ACATTTATAG GTAAAGTGAA GTGA
|
Protein sequence | MKRKAAIFIL LVLIILSLAG CSNGDRAAGT ANNGTNANTE VKTVKIAYLP ITHALPLYVE NELANENFKN FKLELVKFGS WTELVDALNS GKVDGASMLI ELAMKAKEQG IDLKAVALGH RDGNVVVVSK DINKVEDLKG KSFAIPSKLS THNILLHIML KNHGLAYNDV NVVELPPPEM AAALAEGRIS GYCVAEPFGA KSVAVDKGKT LFESQDLWEG SVCCGLVLRN DFIKNNEAIA EEFIKEYIKA GEKAEAKDET IRDIATKYLK AEEQVLDLSL KWISYENLKL EEKDYNELAE YMVEMGLSEN PPKYDEFVDN TFIGKVK
|
| |