Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0548 |
Symbol | nuoF |
ID | 3927930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 548735 |
End bp | 550018 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637901670 |
Product | NADH dehydrogenase I subunit F |
Protein accession | YP_507360 |
Protein GI | 88658598 |
COG category | [C] Energy production and conversion |
COG ID | [COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit |
TIGRFAM ID | [TIGR01959] NADH-quinone oxidoreductase, F subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAAGG ATAGTGATAA AGTATTTACT AATCTTAGTG GTCAATATTC TCCGTTTTTG AGATCTGCAA AAAGTAGGGG TATTTGGAGT AATATTTTAG AAATATTAAA CTTAGGTCCT GAACATATTA TTCAAGAAAT AAAAAAATCA GGATTGCGTG GTAGAGGTGG AGCAGGATTT TCTACAGGAT TGAAATGGAG CTTCATGCCA AAAACTAGAA AAGAAGGACA GCCAGCATAT TTAATAATTA ATGCAGATGA ATCAGAACCT GGTACATGTA AGGACCGTGA TATATTAAGA TATGACCCTC ATAGGCTTAT TGAAGGAATA TTAATAGCTG GTTTTGCAAT GAATGTAACT ACAGCTTATA TTTATATTCG TGGTGAGTTT TATAATGAGT ATTTAGTATT ATCTAAAGCT TTAGAGGAAG CTTATAAAGC TAAGTTAATA GGGAAGAATG CATGTAATTC TGGGTATGAT TTAGATATTT TTATTCATAG AGGTGCTGGA GCATATATTT GTGGTGAGGA AACTGCTCAA CTTGAATCAT TAGAAGGAAG AAAAGGTATG CCAAGATTGA AGCCTCCTTT TCCAGCTGCA ATAGGGTTAT ATGGCTGTCC TACTACTATT AATAATGTTG AAACTATTGC AACTGTTAGT GAGATCATGA GGAGAGGTAG TGATTGGTTT GCATCTCTCG GAAGAGAAAA TAATACTGGT ACTAAGATTT TTTGTATATC AGGTCATGTG AATAACCCAT GTAATGTAGA AGAGGAATTA GGGATTCCCA TGAAGGAGCT TATAGAAAAA TATGCAGGAG GTGTTCGTGG AGGATGGGAT AATTTATTAG CAGTAATACC TGGTGGTTCA TCAGTACCAA TGCTTCCAAA GTCTATTTGT GACACTGTAA ATATGGATTT TGATTCTTTA AGAGCTGTAC AATCAGGGTT AGGTACTGCA GGACTAATAG TTATGGACAA ATCAACAGAT CTAATAGCAG CTATAGAAAG ATTATCACAT TTTTATATGC ATGAATCTTG TGGACAATGT ACACCATGTA GAGAAGGTAC TGGTTGGATG TGGAGAATCA TGAAGAAAAT GGTAAAAGGT GATGCTACAT CTGAAAGCAT TGATTTATTA TTGAATATTA CACATCAAGT AGAAGGGCAT ACTATATGTG CTCTTGGTGA TGCTGCAGCT TGGCCTATAC AAGGATTAAT TAGACATTTT AGAAATGTTA TTGAAGATCG GATAAGGGAT TATAATAATG ATAAATTAAC TTAA
|
Protein sequence | MLKDSDKVFT NLSGQYSPFL RSAKSRGIWS NILEILNLGP EHIIQEIKKS GLRGRGGAGF STGLKWSFMP KTRKEGQPAY LIINADESEP GTCKDRDILR YDPHRLIEGI LIAGFAMNVT TAYIYIRGEF YNEYLVLSKA LEEAYKAKLI GKNACNSGYD LDIFIHRGAG AYICGEETAQ LESLEGRKGM PRLKPPFPAA IGLYGCPTTI NNVETIATVS EIMRRGSDWF ASLGRENNTG TKIFCISGHV NNPCNVEEEL GIPMKELIEK YAGGVRGGWD NLLAVIPGGS SVPMLPKSIC DTVNMDFDSL RAVQSGLGTA GLIVMDKSTD LIAAIERLSH FYMHESCGQC TPCREGTGWM WRIMKKMVKG DATSESIDLL LNITHQVEGH TICALGDAAA WPIQGLIRHF RNVIEDRIRD YNNDKLT
|
| |