Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1603 |
Symbol | |
ID | 6375281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 1729709 |
End bp | 1730827 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642684091 |
Product | NADH dehydrogenase (quinone) |
Protein accession | YP_001960005 |
Protein GI | 189500535 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACCG GAGCTTTTCT GCCAAACAGT TTTCCCATTG TTATGAGTAC AAGCCTCAAC GCCTGGTCTG ATGCTCTCTC TGAGTATTTC CTTTTTGGAC TGCCTGTCGG CCTTGTCATT TTAGCTGCGC TTCCTCTCGT ATTTATTGCG CTCTACGCGT TGACATACGG TGTTTACGGA GAGAGAAAGA TTTCCGCGTT CATGCAGGAC CGGCTTGGCC CGATGGAGGT CGGTAAATGG GGGATTCTTC AGACTCTTGC CGATATTCTG AAACTGCTTC AGAAGGAGGA TATTGTCAAC AAGTCAGCTG ACAAGTTTCT TTTTGTTATC GGCCCCGGAG TTCTTTTTGT CGGTTCATTT CTCGCTTTTG CTGTACTTCC GTTCGGTCCC GCGTTTATCG GTGCCGATCT CAATGTGGGT CTCTTCTATG CCATAGGAAT TGTCGCCCTC GAAGTTGTCG GTATTCTTGC CGCCGGCTGG GGATCAAACA ACAAATGGGC TCTTTACGGA GCTATCCGAA GCGTTGCCCA GATAGTCAGT TATGAGATTC CTGCAGCTAT CGCCATTCTG TGCGGTGTCA TGATGGCAGG GACACTCAGT ATGCAGCAGT TTAATATTCT GCAGCAGGGC GAGTATGGTT TTCTGCACTT TTTCCTTTTC CAGAACCCTA TCGCCTGGCT TCCGTTTCTT ATCTACTTTA TCGCGTCCCT TGCCGAGACA AATCGTGCTC CTTTTGATAT ACCTGAAGCT GAATCCGAGC TTGTTGCCGG TTATTTCACA GAGTACAGCG GTATGAAATT CGCTGTGATC TTTCTTGCGG AATATGCCAG TATGTTTATG GTTTCAGCGA TCATTTCAAT TGTTTTTCTG GGAGGCTGGA ATTCACCGTT TCCCAATATC GGTCCGCTGT TGCTTAATGA CTGGACAACC GGTCCCGTAT GGGGGGCATT CTGGATCATC ATGAAAGGTT TCTTCTTCAT TTTTATCCAG ATGTGGCTCA GATGGACGCT TCCAAGACTG AGAGTTGATC AGCTGATGCA TGTCTGCTGG AAAGTGTTGA CCCCGTTTGC TTTTGTGGCA TTCGTTCTGA CGGCGATATG GGAGATTTAT GTCAAATAG
|
Protein sequence | MSTGAFLPNS FPIVMSTSLN AWSDALSEYF LFGLPVGLVI LAALPLVFIA LYALTYGVYG ERKISAFMQD RLGPMEVGKW GILQTLADIL KLLQKEDIVN KSADKFLFVI GPGVLFVGSF LAFAVLPFGP AFIGADLNVG LFYAIGIVAL EVVGILAAGW GSNNKWALYG AIRSVAQIVS YEIPAAIAIL CGVMMAGTLS MQQFNILQQG EYGFLHFFLF QNPIAWLPFL IYFIASLAET NRAPFDIPEA ESELVAGYFT EYSGMKFAVI FLAEYASMFM VSAIISIVFL GGWNSPFPNI GPLLLNDWTT GPVWGAFWII MKGFFFIFIQ MWLRWTLPRL RVDQLMHVCW KVLTPFAFVA FVLTAIWEIY VK
|
| |