Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4478 |
Symbol | |
ID | 3907454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 5349796 |
End bp | 5351379 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637881810 |
Product | nitrogenase cofactor biosynthesis protein NifB |
Protein accession | YP_483553 |
Protein GI | 86743153 |
COG category | [R] General function prediction only |
COG ID | [COG0535] Predicted Fe-S oxidoreductases |
TIGRFAM ID | [TIGR01290] nitrogenase cofactor biosynthesis protein NifB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0277889 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGCAA CCCCCGTCAG CCTGCCGATG CTGCCCACCG ACCCGGCAGG CGAGATGCCC CGGCAGGCCA CCGGGCCGGC GCCCGACACC GCCGCCGGCT GCGCGAGCAA GTCCAGCTGT GGCACCAGCG CTCCGGTGCG GGACCCGGAG ATCGCGGAGA AGATCGCCAA TCATCCCTGT TACAGCGCCG AGGCCCACCA GTACTACGCC CGGATGCACG TGGCGGTCGC CCCAGGTTGC AACATCCAGT GCAACTACTG CAACCGCAAG TTCGACTGCG CCAACGAGAG CCGTCCCGGC GTCACCAGCA CCCTGCTGTC CCCCGAGGAC GCCCTCGCCA AGGTCAAGCT GGTGGCCAGC GAGATCAAGC AGATGAGCGT GCTGGGGATC GCCGGCCCCG GCGACCCGCT GGCGAACCCG AAGCCGACCT TCCGGACGAT GGAACTGGTG GCCCGGGACT GCCCGGACAT CAAGCTCTGC CTGTCTACCA ACGGGCTGAC CCTGCCGGAC CACGTCGACC GCATCGCCGA ACTGAACGTC GACCACGTCA CCATCACGAT CAACATGATC GACCCCGAGG TCGGGGAGCG GATCTACCCG TGGATCGCCT TCCGCGGTAA GCGGTACACC GGCCGGGAGG CGTCCCGGAT CCTCTCCGAG CGTCAGCTCG AGGGCCTGGC GATGCTCACC GAGCGGAAGA TTCTCTGCAA GGTCAACTCG GTGATGATCC CCGGGATCAA CGATGACCAC CTCGTCGAGG TCTCCCGGAA GGTCAAGGAG CTCGGCGCCT TCCTGCACAA CGTGATGCCG CTGGTGTCGG CGCCCGAGCA CGGCACCCAC TTCGGCCTGA CCGGGCAGCG CGGCCCCACC CCACAGGAGC TCAAGGCGCT GCAGGACCGC TGCGAGCAGG ACGACGGCGC CGAGATGAAC ATGATGCGGC ACTGCCGGCA GTGCCGCGCC GACGCCGTCG GCCTGCTCGG TGAGGACCGG GGTGAGGAGT TCACTCCCGA GGCGTTCCGT GGCCGTGAGA TCGAATACGA CCTCGAAGGC CGCCGGCAGA CGCATAGCGA GATCGAACGG TGGCGCTCCG AGGTGGCTGC CACCCGGGGA GCGCTGAACA TCTCGACCGG CGCGGTCATT CCCTCTGGCC CGGGCGCATC CCCCGATGAG GGCACCCCGG CCGGGGCCCG CCCGGGGAAC GCCCGCCCGG AGAACGTCGT GCTCGTCGCT GTAGCGACCA AGGGCAGCGG CGTGGTGAAC CAGCACTTCG GTCACGCGAC CGAGTTCTGG ATCTACGAGG GGGGTCCGGG CTGGGCCCGG CTCGTGCAGA CCCGCGACGT GGACCGCTAC TGCAACGGCC CGTCGGACTG CGACGAGGAC GCCTCCAAGC TCGACAAGAC GGTCGCGATG CTGTCCGACT GCGCGGCGGT GTTGTGCAGC AAGATCGGCC TCGGGCCGCG CGAGGCGCTC GAGAATGCCG GGATCGAACC GGTGGAGCTC TACGACCTGA TCGAGAAGGC GGTGGCCGAG GTCGGCTCCC GCCTTGTCGC ACATCGTGCC GAAGCGGAGG TTGCCGTCCG ATGA
|
Protein sequence | MKATPVSLPM LPTDPAGEMP RQATGPAPDT AAGCASKSSC GTSAPVRDPE IAEKIANHPC YSAEAHQYYA RMHVAVAPGC NIQCNYCNRK FDCANESRPG VTSTLLSPED ALAKVKLVAS EIKQMSVLGI AGPGDPLANP KPTFRTMELV ARDCPDIKLC LSTNGLTLPD HVDRIAELNV DHVTITINMI DPEVGERIYP WIAFRGKRYT GREASRILSE RQLEGLAMLT ERKILCKVNS VMIPGINDDH LVEVSRKVKE LGAFLHNVMP LVSAPEHGTH FGLTGQRGPT PQELKALQDR CEQDDGAEMN MMRHCRQCRA DAVGLLGEDR GEEFTPEAFR GREIEYDLEG RRQTHSEIER WRSEVAATRG ALNISTGAVI PSGPGASPDE GTPAGARPGN ARPENVVLVA VATKGSGVVN QHFGHATEFW IYEGGPGWAR LVQTRDVDRY CNGPSDCDED ASKLDKTVAM LSDCAAVLCS KIGLGPREAL ENAGIEPVEL YDLIEKAVAE VGSRLVAHRA EAEVAVR
|
| |