Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0710 |
Symbol | |
ID | 5707914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 789210 |
End bp | 790139 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641270228 |
Product | respiratory-chain NADH dehydrogenase subunit 1 |
Protein accession | YP_001535620 |
Protein GI | 159036367 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.98531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000156681 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCTGATA CACCGGTGGC GGTGTCCGGT GGCTGGGCGC CGGTCGCGGC GGGGCTGCTG GGGACTCTCG CGGTTGCTGC GGCGCTACTC GACGGCACCC TGGCTGGGCG GGCGACCGGC GCCAGATCCG CGATGGGGCG GCCGGTGGGT GAGGTGGCGC GGTTGCTGCG GCAGCGGCGT CGGACCACGG TCGCCGCGGA CCGCCTGCTG TGGCGGGTCG GCGGGGCCGG GCTGCTCGTG ATGGCGCTGA TGATCGTCAC GGTGGTGCCG CTCGGGCGGT GGACCCTGTT CGACCTGGAC GTCGGGGTGG TCTGGGTCAA CGCGCTCGAT GTCCTGGCCT GGGCCTTCGT GTGGCTGGCC GGGTGGGGTG CCAACTCCGC GTACTCGCTC GTCGGTGGCT ACCGGTTCCT GGCGCACGGG TTGGCGTACG AGCTGCCGCT GATGTTCGCC CTGGTGGCGC CCGCGGTTGC GGCGTCGAGC TTGCGAGTGG GGGAGGTCGC TGCCGCGCAG CAGGGCCTGT GGTTCGTGGT GTGGATGCCG GTGGCCTTCG TCGTCTACTG CCTCGGCGTG GTGGCCATGG CGGTGTGGGG GCCGTTCTCA CCAGCGCTCG GTGACGATGT GGCGGGTGGG GTGACCGTGG AGCTGTCCGG CGTGGATCGG CTGTTGTTCC TGGCCGGCCG CTACGCGCTA CTGGCGGCGG GGGCGGCCTT CGCCGTACCG ATGTTCCTGG GCGGCGGTGC CGGGCCGCTG CTGCCCGCCT GGGCCTGGGT GTTGGTGAAG GCCTCGGTGC TGCTGGCGCT GCTGGTGTGG CTGCGCCGCC GGGTACCCGC ACTGCGTCCG GACGTGTTCA TGGAGGTGGG CTGGCTGGTG CTGCTGCCGG CGGTGCTGGC GCAGGACCTC CTCGTCGCCG TCCTCGTGAT CGGGAGCTGA
|
Protein sequence | MADTPVAVSG GWAPVAAGLL GTLAVAAALL DGTLAGRATG ARSAMGRPVG EVARLLRQRR RTTVAADRLL WRVGGAGLLV MALMIVTVVP LGRWTLFDLD VGVVWVNALD VLAWAFVWLA GWGANSAYSL VGGYRFLAHG LAYELPLMFA LVAPAVAASS LRVGEVAAAQ QGLWFVVWMP VAFVVYCLGV VAMAVWGPFS PALGDDVAGG VTVELSGVDR LLFLAGRYAL LAAGAAFAVP MFLGGGAGPL LPAWAWVLVK ASVLLALLVW LRRRVPALRP DVFMEVGWLV LLPAVLAQDL LVAVLVIGS
|
| |