Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1290 |
Symbol | |
ID | 4071362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1567755 |
End bp | 1569035 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637983299 |
Product | respiratory-chain NADH dehydrogenase, subunit 1 |
Protein accession | YP_590366 |
Protein GI | 94968318 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.673572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCTCAC ACTTTCTGAA CTTTCTGAAA TTCAATGGAA CGCCGGGCGA ATACGGCAGC CCGCTGTGGG CCACGGTTTA CATCCTCGTG ATCTTTGGAG TGGCGTCAGT GGCGGTGATG CTGATGACCT ACCTGGAGCG CAAGGTGCTG GCGCACATGC AGATTCGCCT CGGCCCCATG CGCGTAGGTC CGCACGGATT GCTGCAGCCG ATTGCTGATG CGCTGAAGCT GCTTATCAAA GAAGACATCG TTCCCGACGG CGCCGACAAG TTCCTGTTCT GGATGGCTCC GGTCACGGTG ATGATGACAG CGTTCACCAC GTACCTAGTA ATCCCGTTTG GACGCAGCCA TGCAGTGACG GACATGAACA TCGGTGTCTT GTTCATGATC GGCATCTCGT CGCTCGGCGT GCTGGCGGTG GTGATGGCGG GGTGGTCGTC GAACTCGAAG TACGCGTTGA TGGGCGGGCT CCGTTCGGCG GCACAGATGG TGAGCTACGA AGTGGCGATG GGGCTGGCGA TTGTCAGCGT GTTGATGATG ACCTCGCTGC AGACCGGCAC TGGAACGCTG AGCATGATCG GCATTGTGCA GGCACAACAA GCGCAGGGTT CGTGGTTCAT CTTTAAGTTC TTTCCGACCG GGCTGGTGGC GTTCGTGATC TTCGCGATCG CGATGGTCGC TGAGACGAAT CGCGCACCGT TCGATCTGCC GGAAGCTGAA AGCGAATTGA CTGCCGGCTT CCACACCGAA TACAGCGGGT TCCGCTGGTC GTTGTTCTTC CTCGGCGAAT ACGTGGCGAT GATCGCGGTT TCGTCGATCG CGGTAACGCT CTGGCTCGGC GGATGGCTGC GGCCGTTCCC GAATGCGCTG AGCGGCGCTA CGTGGGACTT CGCATTTTCG GTCTTCCCGG CGCTGTTGTT CTTCGTGCTG GCCGCGGGAT GTTTTATCGG ATGGGTTCGC ATGCCGAGCA AGCCGGCGTT CAAGGTCCAA GCGATCGGGC TCGGTATCTT CGGCGTGCTG CTGGGGATGA TCGGCGCGGT GCTGCTGATT CCCGCAGTGC GAGTACGGGT GAGCGACATC TTCTGGTTCT CGGCGAAGGT CGGCGTCTTT ATGTACCTCT ACATCTGGTA TCGCGGGACG TTCCCCCGGT ATCGCTTCGA CCAGCTGATG AAGATTGGCT GGAAGGTGCT GCTGCCGGTA TCGCTCGGGG TGCTGATTGT GACTGCGGTA CTCGGCGTAC GGCATGAGCT GATCGCTGGA TTGATGGGGG TGGCGCGATG A
|
Protein sequence | MISHFLNFLK FNGTPGEYGS PLWATVYILV IFGVASVAVM LMTYLERKVL AHMQIRLGPM RVGPHGLLQP IADALKLLIK EDIVPDGADK FLFWMAPVTV MMTAFTTYLV IPFGRSHAVT DMNIGVLFMI GISSLGVLAV VMAGWSSNSK YALMGGLRSA AQMVSYEVAM GLAIVSVLMM TSLQTGTGTL SMIGIVQAQQ AQGSWFIFKF FPTGLVAFVI FAIAMVAETN RAPFDLPEAE SELTAGFHTE YSGFRWSLFF LGEYVAMIAV SSIAVTLWLG GWLRPFPNAL SGATWDFAFS VFPALLFFVL AAGCFIGWVR MPSKPAFKVQ AIGLGIFGVL LGMIGAVLLI PAVRVRVSDI FWFSAKVGVF MYLYIWYRGT FPRYRFDQLM KIGWKVLLPV SLGVLIVTAV LGVRHELIAG LMGVAR
|
| |