Gene Francci3_0525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0525 
Symbol 
ID3905436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp610351 
End bp612060 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content70% 
IMG OID637877854 
Productferredoxin--nitrite reductase 
Protein accessionYP_479638 
Protein GI86739238 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.873826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.355394 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCTC CTGCTCGTCC TGCCCGGTCC GCGCGCCCGT CCCGACCCAA GGGAGCGGGC 
CAGTGGGCGT TGGGATATTC CGAGCCGCTC AACGCGAACG AGCGGATGAA GAAGGACCAA
GGCGGCCTGG AGGTGCGGGA CCGCATCCTC AACCTCTACC CGCGCACCGG TTTCGACGGG
ATCGATCCCC AGGACCTGCG GGGCCGGTTC CGGTGGTGGG GTCTGTACAC CCAGCGCCGG
CCGGGGATCA GTGGCGGGCG GACGGCGATC CTCGAACCCG AGGAACTGGA CGACTCCTAC
TTCATGCAGC GCATCCGGCT CGACGGCGGC CGGATGACCT CGGATCAGCT CCGGGTGATC
GCCGACGTCT CCACCTGCTA CGGCCGGGAC GTCGCTGATG TCACCGACCG GCAGAACATC
CAACTCCACT GGATCCGGAT CGAGGATGTG CCGGCGATCT GGGCGGCGCT GGAGAACGCC
GGCATGACCA CCGCCGAGGC CTGTGGCGAC ACCCCCCGGG TGATCCTCGG TTGCCCGCTC
GCCGGGGTCG ACGGTGACGA GATCATCGAC GGGAGCGAGG CGATCGACGA GGTCGTCCGC
CGGTTCGTCG GCGACCCCAC GCTGGCCAAC CTGCCCCGTA AGTTCAAAAG CGCCATCTCC
GGCTGCGCCG ACCACTGCAC GTTGCACGAG ATCAACGACA TCGCGTTCGT CGGGGTCGTC
CATCCGGAGC TGGGAGCCGG CTACGACCTG TGGGTCGGTG GCGGTCTGTC GACGAACCCA
CGGCTCGGCG TGCGGCTGGG CGCGTTCGTG CCGCCGGCGC GGGTTGCCGA GGTCTGGCAC
GGCGTCGTAT CGCTGTTCCG AGACTACGGC TACCGACGGC TGCGCACCCG CGCCCGGTTG
AAGTTCCTGG TCGCGGACAT GGGCGCCGAA TGGGTGCGCG CGACTCTGGA GAAGGAGTAC
CTGGCTTCGG CGCTGCCCGA TGGCCCGCCG CCGGCCCCGC CGCGCAGCGA GGGGCGCGAC
CACATCGGGG TGCACCGCCA GCGCGACGGC CGCAACGCCG TGGGGTTCGC ACCGCGGGCG
GGTCGGCTCA GCGGGACCAC GCTGAGCGCG GTGGCGGACC TCGCCGACCG GTTCGGGCAG
GGCCGCATCC GCGCCACGAC GACGCAGAAG CTCGTCATCC TCGATGTCGC CGACGCGGAC
GTCAACGCGC TGGAGGCCGA ACTCGGTGCG CTCGACCTGG TGGTCCGCCC GAGCGTGTTC
CGGCGCGGCA CGATGGCCTG CACCGGTATC GAGTTCTGCA AACTGGCGAT CGTCGAGACC
AAGGGTCGGG CCCGCGACCT CATCGACGAG CTCGAACGCC GGCTGCCCGA CTTCGACGAG
CCGATCGGGA TCAACGTCAA CGGCTGCCCC AACGCCTGCG CCCGCTTTCA GGTCGCCGAC
ATCGGGCTCA AGGGCTCCCT GGTACCGGAC GACGCCACCG GTGAGATGGT CGAGGGCTTC
CAGGTCCACC TCGGCGGGCA CCTGGGCACC AGGTCGCGGC TGGGCCGCAA GTCGCGTGGT
CTGAAGGTGA CCGCCGACGG CCTCGTCGAC TACGTCGTCG CGGTGCTGGA GACCTACCGG
GCGGACCGGA CCGCGGGGGA GAGCTTCGCG GACTGGGCGG AGCGGGCCGA CGAGGCCCAA
CTGACGGCGC TCGGTGCCGC AGATGGCTAG
 
Protein sequence
MTSPARPARS ARPSRPKGAG QWALGYSEPL NANERMKKDQ GGLEVRDRIL NLYPRTGFDG 
IDPQDLRGRF RWWGLYTQRR PGISGGRTAI LEPEELDDSY FMQRIRLDGG RMTSDQLRVI
ADVSTCYGRD VADVTDRQNI QLHWIRIEDV PAIWAALENA GMTTAEACGD TPRVILGCPL
AGVDGDEIID GSEAIDEVVR RFVGDPTLAN LPRKFKSAIS GCADHCTLHE INDIAFVGVV
HPELGAGYDL WVGGGLSTNP RLGVRLGAFV PPARVAEVWH GVVSLFRDYG YRRLRTRARL
KFLVADMGAE WVRATLEKEY LASALPDGPP PAPPRSEGRD HIGVHRQRDG RNAVGFAPRA
GRLSGTTLSA VADLADRFGQ GRIRATTTQK LVILDVADAD VNALEAELGA LDLVVRPSVF
RRGTMACTGI EFCKLAIVET KGRARDLIDE LERRLPDFDE PIGINVNGCP NACARFQVAD
IGLKGSLVPD DATGEMVEGF QVHLGGHLGT RSRLGRKSRG LKVTADGLVD YVVAVLETYR
ADRTAGESFA DWAERADEAQ LTALGAADG