Gene Francci3_4478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4478 
Symbol 
ID3907454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5349796 
End bp5351379 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content69% 
IMG OID637881810 
Productnitrogenase cofactor biosynthesis protein NifB 
Protein accessionYP_483553 
Protein GI86743153 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR01290] nitrogenase cofactor biosynthesis protein NifB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0277889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGCAA CCCCCGTCAG CCTGCCGATG CTGCCCACCG ACCCGGCAGG CGAGATGCCC 
CGGCAGGCCA CCGGGCCGGC GCCCGACACC GCCGCCGGCT GCGCGAGCAA GTCCAGCTGT
GGCACCAGCG CTCCGGTGCG GGACCCGGAG ATCGCGGAGA AGATCGCCAA TCATCCCTGT
TACAGCGCCG AGGCCCACCA GTACTACGCC CGGATGCACG TGGCGGTCGC CCCAGGTTGC
AACATCCAGT GCAACTACTG CAACCGCAAG TTCGACTGCG CCAACGAGAG CCGTCCCGGC
GTCACCAGCA CCCTGCTGTC CCCCGAGGAC GCCCTCGCCA AGGTCAAGCT GGTGGCCAGC
GAGATCAAGC AGATGAGCGT GCTGGGGATC GCCGGCCCCG GCGACCCGCT GGCGAACCCG
AAGCCGACCT TCCGGACGAT GGAACTGGTG GCCCGGGACT GCCCGGACAT CAAGCTCTGC
CTGTCTACCA ACGGGCTGAC CCTGCCGGAC CACGTCGACC GCATCGCCGA ACTGAACGTC
GACCACGTCA CCATCACGAT CAACATGATC GACCCCGAGG TCGGGGAGCG GATCTACCCG
TGGATCGCCT TCCGCGGTAA GCGGTACACC GGCCGGGAGG CGTCCCGGAT CCTCTCCGAG
CGTCAGCTCG AGGGCCTGGC GATGCTCACC GAGCGGAAGA TTCTCTGCAA GGTCAACTCG
GTGATGATCC CCGGGATCAA CGATGACCAC CTCGTCGAGG TCTCCCGGAA GGTCAAGGAG
CTCGGCGCCT TCCTGCACAA CGTGATGCCG CTGGTGTCGG CGCCCGAGCA CGGCACCCAC
TTCGGCCTGA CCGGGCAGCG CGGCCCCACC CCACAGGAGC TCAAGGCGCT GCAGGACCGC
TGCGAGCAGG ACGACGGCGC CGAGATGAAC ATGATGCGGC ACTGCCGGCA GTGCCGCGCC
GACGCCGTCG GCCTGCTCGG TGAGGACCGG GGTGAGGAGT TCACTCCCGA GGCGTTCCGT
GGCCGTGAGA TCGAATACGA CCTCGAAGGC CGCCGGCAGA CGCATAGCGA GATCGAACGG
TGGCGCTCCG AGGTGGCTGC CACCCGGGGA GCGCTGAACA TCTCGACCGG CGCGGTCATT
CCCTCTGGCC CGGGCGCATC CCCCGATGAG GGCACCCCGG CCGGGGCCCG CCCGGGGAAC
GCCCGCCCGG AGAACGTCGT GCTCGTCGCT GTAGCGACCA AGGGCAGCGG CGTGGTGAAC
CAGCACTTCG GTCACGCGAC CGAGTTCTGG ATCTACGAGG GGGGTCCGGG CTGGGCCCGG
CTCGTGCAGA CCCGCGACGT GGACCGCTAC TGCAACGGCC CGTCGGACTG CGACGAGGAC
GCCTCCAAGC TCGACAAGAC GGTCGCGATG CTGTCCGACT GCGCGGCGGT GTTGTGCAGC
AAGATCGGCC TCGGGCCGCG CGAGGCGCTC GAGAATGCCG GGATCGAACC GGTGGAGCTC
TACGACCTGA TCGAGAAGGC GGTGGCCGAG GTCGGCTCCC GCCTTGTCGC ACATCGTGCC
GAAGCGGAGG TTGCCGTCCG ATGA
 
Protein sequence
MKATPVSLPM LPTDPAGEMP RQATGPAPDT AAGCASKSSC GTSAPVRDPE IAEKIANHPC 
YSAEAHQYYA RMHVAVAPGC NIQCNYCNRK FDCANESRPG VTSTLLSPED ALAKVKLVAS
EIKQMSVLGI AGPGDPLANP KPTFRTMELV ARDCPDIKLC LSTNGLTLPD HVDRIAELNV
DHVTITINMI DPEVGERIYP WIAFRGKRYT GREASRILSE RQLEGLAMLT ERKILCKVNS
VMIPGINDDH LVEVSRKVKE LGAFLHNVMP LVSAPEHGTH FGLTGQRGPT PQELKALQDR
CEQDDGAEMN MMRHCRQCRA DAVGLLGEDR GEEFTPEAFR GREIEYDLEG RRQTHSEIER
WRSEVAATRG ALNISTGAVI PSGPGASPDE GTPAGARPGN ARPENVVLVA VATKGSGVVN
QHFGHATEFW IYEGGPGWAR LVQTRDVDRY CNGPSDCDED ASKLDKTVAM LSDCAAVLCS
KIGLGPREAL ENAGIEPVEL YDLIEKAVAE VGSRLVAHRA EAEVAVR