Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6087 |
Symbol | |
ID | 5674408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7409281 |
End bp | 7410669 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641244939 |
Product | NADH dehydrogenase subunit H |
Protein accession | YP_001510337 |
Protein GI | 158317829 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0085721 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0109132 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGGA CCTCGACGCT CCTGCTCGCC GCGAACGTCG GCGACCCGGA CATGTCCGTC CTCACCGACG ACCCGTTCTG GCTCATCCTG ATCAAGGCTG TCGCGGTCTT CGCGTTCCTG CTGGTGATGA CGCTGTTCTC GATCGTCTTC GAGCGCAAGG TCGTCGCGAA GATGCAGCAG CGGGTCGGGC CGAACCGGCA CGGCCCGCGC GGCTGGCTGC AGAGCCTCGC CGACGGCGTG AAGCTCATGC TCAAGGAAGA CATCATCCCG ACGCTGGCCG ACAAGCCGAT CTTCATCCTG GCTCCGGTCA TCTCGGCGGT GCCGGCCATC CTGGCCTTCG CCTGCATCCC GTTCGGGCCG GAGGTGTCGA TCTTCGGTGA ACGCACGACG CTCCAGCTCG CCGACCTGCC GGTCAGCGTG CTGTACCTGC TGGCGGCGGC GTCGCTGGGC GTGTACGGAC TGATCCTCGC CGGCTGGTCC AGCGGCTCGA CCTACCCGCT GCTGGGCTCG CTGCGCTCGG CCGCGCAGAT CATCTCCTAC GAAGTCGCGA TGGGCCTGGC CTTCGTCGCG GTCTTCGTCT ACGCGGGAAC ACTGTCGACC GCGGGCATCG TGCAGAGCCA GCACGACTGG TGGTACATCG CGCTGCTGCC GTCGTTCATC CTCTACTGCA TCGCGATGGT CGGTGAGACG AACCGGACGC CGTTCGACCT CCCCGAGGCC GAGGGCGAGC TGGTCGGCGG GTTCCACACC GAGTACAGCT CGATCAAGTT CGCGTTCTTC TTCCTCGCCG AGTACATCAA CATGGTCACC GTCTCGGCGA TCGCGACCAC ACTGTTCCTC GGAGGCTGGC AGCCGCCGCC GATCCCGGGC CTGTCCGGGC TGGACCACGG CTGGTACCCG CTGATCTGGT TCGTCATCAA GCTGCTGCTG TTCATCTTCG TGTTCATCTG GCTGCGGGGC ACGCTGCCGC GGCTGCGCTA CGACCAGTTC ATGGCCTTCG GCTGGAAGGT CCTGATCCCG GTCGGCCTGG TGTGGGTGCT CGCGGTCGCG ACCTTCCGCG CCTACCAGGA GCACGTGAGT GACCGGACGC CATGGCTGAT CGGCTTCGGG GTCGTGGTGG GCGTCCTGCT CGTGGTTGCG ATCATCGACC CGGGCGCGGC GAGGGCCCAG CGCGAGCAGG AGGAGGCCGA GCGGGAACGC GCCGAATCGG CCCCGAGCCT GGACAGGATT CCCTGGCCGC CGCCGTCCGA CGGCGCGACG AGGGCACTGG CCGGCCGCAC TGCGGCTGGC AGCACTGCAG CCGGCAGCGG CGCGGCCGGC TCCGCCGGCG GCGACAAGGG AAACACCACC GTCATCCCCG CGGGCTCCGG CCCGCGACAG GAGAGCTGA
|
Protein sequence | MTGTSTLLLA ANVGDPDMSV LTDDPFWLIL IKAVAVFAFL LVMTLFSIVF ERKVVAKMQQ RVGPNRHGPR GWLQSLADGV KLMLKEDIIP TLADKPIFIL APVISAVPAI LAFACIPFGP EVSIFGERTT LQLADLPVSV LYLLAAASLG VYGLILAGWS SGSTYPLLGS LRSAAQIISY EVAMGLAFVA VFVYAGTLST AGIVQSQHDW WYIALLPSFI LYCIAMVGET NRTPFDLPEA EGELVGGFHT EYSSIKFAFF FLAEYINMVT VSAIATTLFL GGWQPPPIPG LSGLDHGWYP LIWFVIKLLL FIFVFIWLRG TLPRLRYDQF MAFGWKVLIP VGLVWVLAVA TFRAYQEHVS DRTPWLIGFG VVVGVLLVVA IIDPGAARAQ REQEEAERER AESAPSLDRI PWPPPSDGAT RALAGRTAAG STAAGSGAAG SAGGDKGNTT VIPAGSGPRQ ES
|
| |