Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6880 |
Symbol | |
ID | 5675193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8384567 |
End bp | 8386027 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641245729 |
Product | nitrogenase molybdenum-iron protein alpha chain |
Protein accession | YP_001511120 |
Protein GI | 158318612 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01282] nitrogenase molybdenum-iron protein alpha chain [TIGR01862] nitrogenase component I, alpha chain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.782368 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGA CTCCGGCTCC GCAGCGGGCC GAGACCGAGG CGATGATCGC CGAGGTCCTG GAGCAGTACC CCCAGAAGGC GGCCAAGTTC CGCGCCAAGC ACCTCAAGGC CAACGACCCC GAGGGGTCGA AGGAGTGCGA GGTCAAGTCC AACATCAAGT CCCGCCCCGG GGTCATGACG ATCCGTGGCT GTGCCTACGC CGGTTCCAAG GGCGTGGTGT GGGGCCCGGT GAAGGACCTG GTCACGATCA GCCACGGCCC GGTCGGCTGC GGCCAGTACT CCTGGGCCAC CCGGCGCAAC TACGCCCGCG GTCCGTGGGG TGTCACCAAC TTCACCGCGA TGCAGATCAC CACGGACTTC CAGGAGAAGG ACATCGTCTT CGGCGGTGAC CCCAAGCTGG AGCAGGTCTG CGACGAGATC AACGAGCTCT TCCCGCTGGC CAAGGGCATC TCGGTCCAGT CCGAGTGCCC GATCGGTCTG ATCGGCGACG ACATCGAGGC GGTTTCCCGC ACGGCGTCGA AGAAGCTTGG CAAGCCGGTC ATCCCGGTCC GCTGTGAGGG CTTCCGCGGG GTGAGCCAGT CGCTCGGCCA CCACATCGCC AACGACGCGG TCCGCGACCA CGTTCTCGGC ACCGGCGGCG AGTCCTTCAA GGAGACCCCG TACGACGTCG CGCTCATCGG CGACTACAAC ATCGGCGGCG ACGCCTGGGC GTCCCGGCGG ATCCTCGAGG ACATGGGCCT GCGCGTCATC GCCCAGTGGT CCGGCGACGG CACGCTGAAC GAGATGGCCT CGACGCACCT GTCGAAGCTG AACCTGATCC ACTGCTACCG CTCCATGAAC TACATCTGCA CCACCATGGA GGAGCGTTAC GGCACTCCGT GGATCGAGTT CAACTTCTTC GGGCCGACCA AGATCGTCAA CTCGATGCGG GCCATCGCGG CCAAGTTCGA CGAGACGATC CAGGCGAAGA CCGAGGCCGC GATCGCGCGT TACCAGAAGC GCTTCGACGA GATCACGGCG GCGTTCAAGC CGCGCCTCGA CGGCAAGCGC GTCATGCTCG CTGTCGGCGG CCTGCGTCCC CGTCACACCA TCGGCGCCTA CGAGGACCTC GGCATGGAGG TCGTCGGCAC CGGTTACGAG TTCGCGCACA AGGACGACTA CACCCGCACG TACGCCGAGC TCAAGGAAGG CGTCGTGCTC TACGACGACC CGACGGCGTT CGAGCTGGAG GAGTTCGCCA AGCGGCTCAA GCCGGACCTC ATGGGTGCGG GCGTCAAGGA GAAGTACGTC TTCCACAAGA TGGGCATCCC CTTCCGCCAG ATGCACTCCT GGGACTACTC CGGGCCGTAC CACGGCGTCG ACGGCTTCGC GGTCTTCGCC CGCGACATGG ACATCGCGAT CAACAGCCCG ACCTGGGACC TCATGGAGAC CCCCTGGTCG AAGGCCGGAG AGGTGTTCTG A
|
Protein sequence | MTTTPAPQRA ETEAMIAEVL EQYPQKAAKF RAKHLKANDP EGSKECEVKS NIKSRPGVMT IRGCAYAGSK GVVWGPVKDL VTISHGPVGC GQYSWATRRN YARGPWGVTN FTAMQITTDF QEKDIVFGGD PKLEQVCDEI NELFPLAKGI SVQSECPIGL IGDDIEAVSR TASKKLGKPV IPVRCEGFRG VSQSLGHHIA NDAVRDHVLG TGGESFKETP YDVALIGDYN IGGDAWASRR ILEDMGLRVI AQWSGDGTLN EMASTHLSKL NLIHCYRSMN YICTTMEERY GTPWIEFNFF GPTKIVNSMR AIAAKFDETI QAKTEAAIAR YQKRFDEITA AFKPRLDGKR VMLAVGGLRP RHTIGAYEDL GMEVVGTGYE FAHKDDYTRT YAELKEGVVL YDDPTAFELE EFAKRLKPDL MGAGVKEKYV FHKMGIPFRQ MHSWDYSGPY HGVDGFAVFA RDMDIAINSP TWDLMETPWS KAGEVF
|
| |