Gene Francci3_0863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0863 
Symbol 
ID3903846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1006240 
End bp1008525 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content72% 
IMG OID637878196 
Productassimilatory nitrate reductase (NADH) alpha subunit apoprotein 
Protein accessionYP_479976 
Protein GI86739576 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.410026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTGC CGGCCGCGCC CCTTGCCGAG GCTGCGACAC ACTGCCCGTA CTGCGCCCTG 
CAGTGCGGAA TGATGCTGCG CGCCACCCCG GAGAGCGCCG CCAAGGAAGG CACCGGAAAA
GACGACGGCC CGGAACCCGC AGGTGGAAGG TCTCCGGGGC CGGTTACCGT GCTCGCCCGC
GAGTTCCCCA CCAACCGGGG CCGGATGTGC CAGAAGGGGT GGACCTCCGC CGAGCTGCTG
ACCGCCGCGG ACCGGCTCAC CACCCCGCTG ATGCGGCCGC GACGCGACGC CCCCCTCGAG
CCGGTGAGCT GGGAACGGGC ACTCGACCGC ATCACCGAGG GGATCATCCG CCTCCAGCGC
GAACACGGCC CGGACGCGGT CGGCGTCTTC GGCGGCGGCG GGTTGACCAA CGAGAAGGCC
TACGCCCTAG GCAAGTTCGC TCGGGTGGCG CTGCGCAGCT CCGCGATCGA CTACAACGGC
CGGTTCTGCA TGTCGTCGGC GGCCGCCGCG GGCATCCGCT CGTTCGGGCT GGACCGGGGC
CTGCCCTTCC CACTGGAGGA CATCGCGCAC GCCGGGGCCG TCATCATGGT CGGCAGCAAC
GCGGCCGAGA CCATGCCCCC GTTCATGCAG TATCTGACCC GCCAGCGGGA GAACGGCGGG
GCGCTCGTGG TCGTCGACCC GCGGCTGACG GCCACCGCCC GCGCCGCGGC CCTACATCTG
CAGATCACCC CGGGAACTGA CCTGGCCCTG GCCAACGGGC TGGCCTACAT CGCCCTAACG
GAGGGGTACG CCAACCGCGA CTTCCTCGCC GCTCGGACGA CGGGCCTCGC CGAGCTGCGA
GCAGTACTCG CCTCCTACTG GCCCGAACGG GTGGAGCGCA TCACCGGAGT GCCGGTGGCA
GACCAGTACG CCGCCGTGGA CCTGCTCGCC CAGGCCGAAC GGGCGATGGT GCTCAGCGCC
CGCGGTGCCG AACAACACAG CAAGGGCACC GACACCGTCA CGGCGCTCAT CAACCTCGCC
CTCGTACTGG GGCTGCCCGG CACCCCCGGG TCGGGCTACG GCTGTCTCAC CGGCCAGGGC
AACGGGCAGG GCGGCCGGGA ACACGGGCAG AAGGCCGACC AGCTTCCCGG CTATCGCAAG
ATCATCGATC CGGTGGCCCG GGAGCACATC GGTCGGATCT GGGGCGTCGA CCCGACGACC
ATCCCCGGTC CGGGCCGCAG CGCCTACGAG ATGCTGGACG CGCTCGGCAC GCCGCAGGGC
CCGAGGGCGC TCCTCATCCT CGGCAGCAAC ATCGCCGTCT CCGCCCCCCG GGCCGGTCGG
ATCACCTCGC GGCTGGCCGC CTTGGACCTG CTCGTCGTCG CCGACTTCGT GCTCAGCGAG
ACGGCGGCCA TGGCCGACAT CGTCCTGCCG ACCGCCCAGT GGGCGGAGGA GGAGGGGACG
ATGACCAACC TGGAGGGTCG GGTACTGCGG CGGGAACGGC TGCGGCCTCC TCCGGCCGGG
GTACGCACCG ACCTGGAGAT CATCGCGGCG CTCGCCGCCC GACTCGGCCA CGCCGAGCGC
TTTCCCGGCG AACCACGGGC GGTCTTCGAC GAGCTGCGCC GGGCCAGCGC CGGAGGGATC
GCCGACTACG CGGGCATCAG CTACGAGCGG ATCACCGCAT CGGACGGGGT GTTCTGGCCC
TGCCCGGACG AGACGCACCC GGGCACCCCC CGGATGTTCC TGGATCGCTT CGCCACCCCG
GACGGACGGG CCCGGCTCGT GCCGGTCGAG CACCGTCCGG TGGCGGAGGA CATCGACCCC
GAGTACCCCT ACTACCTGAC CACCGGGCGG GTGCTGGCTC ATTACCAGAG CGGTGCGCAG
ACCCGCCGGA TCGGACCCCT CGTTGACGCC GCGCCGGAAC CCTTCGTCGA GATCCATCCC
GACCTCGCCG AACGGCTCGG GATCGCTGAG GGGGCGCCGG TGCGGGTCAC GAGCCGACGA
GGCACCTGCG AGGTGCCGGC GAGGCTGACC GACACCCTCC GATTCGACAC CGTCTTCCTG
CCCTTCCACT GGGCCGGGGC CGGACGGGCG AACTCCCTGA CCAACGACGC GCTCGACCCG
ACGTCCCGGA TGCCCGAGTT CAAGGTGTGC GCGGTCGCGG TCGAGCCGAT CGACGACCCG
GACGACAACC CGGACGACGA TCGGCATGCC ACCGGGTCCG TCCCCACCGC GACGGTCGGG
GCTGCCGAAC CCGTCCGAGC CATGTTGACG GGCCGCAGCC AACCGAGCGG AAGGCACCAA
GGGTGA
 
Protein sequence
MPVPAAPLAE AATHCPYCAL QCGMMLRATP ESAAKEGTGK DDGPEPAGGR SPGPVTVLAR 
EFPTNRGRMC QKGWTSAELL TAADRLTTPL MRPRRDAPLE PVSWERALDR ITEGIIRLQR
EHGPDAVGVF GGGGLTNEKA YALGKFARVA LRSSAIDYNG RFCMSSAAAA GIRSFGLDRG
LPFPLEDIAH AGAVIMVGSN AAETMPPFMQ YLTRQRENGG ALVVVDPRLT ATARAAALHL
QITPGTDLAL ANGLAYIALT EGYANRDFLA ARTTGLAELR AVLASYWPER VERITGVPVA
DQYAAVDLLA QAERAMVLSA RGAEQHSKGT DTVTALINLA LVLGLPGTPG SGYGCLTGQG
NGQGGREHGQ KADQLPGYRK IIDPVAREHI GRIWGVDPTT IPGPGRSAYE MLDALGTPQG
PRALLILGSN IAVSAPRAGR ITSRLAALDL LVVADFVLSE TAAMADIVLP TAQWAEEEGT
MTNLEGRVLR RERLRPPPAG VRTDLEIIAA LAARLGHAER FPGEPRAVFD ELRRASAGGI
ADYAGISYER ITASDGVFWP CPDETHPGTP RMFLDRFATP DGRARLVPVE HRPVAEDIDP
EYPYYLTTGR VLAHYQSGAQ TRRIGPLVDA APEPFVEIHP DLAERLGIAE GAPVRVTSRR
GTCEVPARLT DTLRFDTVFL PFHWAGAGRA NSLTNDALDP TSRMPEFKVC AVAVEPIDDP
DDNPDDDRHA TGSVPTATVG AAEPVRAMLT GRSQPSGRHQ G