Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1941 |
Symbol | |
ID | 7268857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 2375494 |
End bp | 2376441 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643566779 |
Product | NMT1/THI5 like domain protein |
Protein accession | YP_002463272 |
Protein GI | 219848839 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.876036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000270678 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAACAC CTATTACCTT AGCCCTCGAT TGGACACCCA ACACAAATCA TATTGGGTTC TACGTAGCTA TTGCCAAGGG ATGGTACCGT GACGCCGGAA TCGAACCGAT CATGCTGTCC CCGGAAGAGG ACAATTATCA GACGACACCG GCTGCGAAGG TGGTAGCAGG AAGAGCGTTA TTGGCAATTG CTCCCTCGGA GAGTGCTTTG AGTTACCATC TTCACCCCAC CAAACCATCG TTGGTTGCCA TTGCAGCGCT AGCACAACGC GACACAAGTG CGATTGTGAC ATTAGCCAAC AGCGGCATTG ATCGACCGGC CAAACTTGAT GGTCGTCGCT ACGCTTCGTA CAACGCACGA TTTGAGCGCG CAATCGTAGC GCAGATGATC CGCAACGACG GAGGGAAGGG CGAATTCGAT GAGATCTTTC CACCCAAACT CGGCATCTGG GAAACATTGC TCACCAGCGT TGCCGATGCA ACATGGGTCT TTATGCCGTG GGAAGGGGTG CAAGCCCGTC GAGCCGGGAT CGCTCTCAAC GCCTTCCACC TCGACGACTA CGGCATTCCT TACGGCTACA CGCCGATATT GTTGGCCCAC CCGGATGCAC TCCGCACGCA TCCAGATGCC CTGCGCGCAT TATTGAATGC CACTGCCGAG GGCTACCGCT TCGCCGTTCA TCATCCCGAT GAAGCCGTGG CAGCACTTAT CACGGAGGCT AAGCACCCGA GCTTGCAGGA TCGCGATTTT GTGACCGAAA GTTTGTATGA ACTCGCCCCC GCTCTGCTGA CCGCTGATGG TCGGTGGGGG GTGATGGACG GCCAGCGGTG GCAAGCCTTT GTGACGTGGC TCGACCAACA AGGTCTGATT GTGGATCGGA ATGGACAGCG CATCCCGTTA GCACCAGATA CATACCTTGC TCTGTTTACA AATGAGCTTT GGAACTAG
|
Protein sequence | MSTPITLALD WTPNTNHIGF YVAIAKGWYR DAGIEPIMLS PEEDNYQTTP AAKVVAGRAL LAIAPSESAL SYHLHPTKPS LVAIAALAQR DTSAIVTLAN SGIDRPAKLD GRRYASYNAR FERAIVAQMI RNDGGKGEFD EIFPPKLGIW ETLLTSVADA TWVFMPWEGV QARRAGIALN AFHLDDYGIP YGYTPILLAH PDALRTHPDA LRALLNATAE GYRFAVHHPD EAVAALITEA KHPSLQDRDF VTESLYELAP ALLTADGRWG VMDGQRWQAF VTWLDQQGLI VDRNGQRIPL APDTYLALFT NELWN
|
| |