Gene Francci3_3146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3146 
Symbol 
ID3903943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3722948 
End bp3724768 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content72% 
IMG OID637880467 
ProductNH(3)-dependent NAD(+) synthetase 
Protein accessionYP_482232 
Protein GI86741832 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0171] NAD synthase
[COG0388] Predicted amidohydrolase 
TIGRFAM ID[TIGR00552] NAD+ synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.814025 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGC TGCGGATCGC CCTCGCCCAG GTGGACACGA CCGTCGGGGA CCTCGCCGGT 
AACGCCGACC TGGTCAGCAC CTGGACGAAA CGGGCCGTCG CCGACGGCGC TCATCTGATC
GCCTTCGGTG AACTCACGTT GACCGGCTAT CCGCCGGAGG ACCTCGTCCT GCGCCGCTCC
TTCGTCGCCG CGAGCCAGCG GGCGCTGGTA GCTCTGGCCC GCCGGCTGGC CGAGGAGGGG
TGTGGCGAGA TCGCCGTCGT CGTCGGCTAC CTCGACGCAT CCGCGCAACC CGCGCCGAAT
GTGGGACGCC CGGCCGGTGA ACCGCAGAAC GCCGCCGCGG TGCTCTGGGG CGGCGAGGTC
GTCGCACGTT ACGCCAAGCA TCACCTGCCG AACTACGGCG TCTTCGACGA GTTCCGCTAC
TTCGTGCCCG GCATGGCCTT TCCGGTGCTG CGGCTGCACG GGATCGACGT GGGCCTGACG
ATCTGCGAGG ACCTGTGGCA GCAGGGCGGG CCGATCACCG TGGCCCGCCG GTCCGGGGTC
GGCCTCGTCC TGTGCATCAA CGGCTCCCCT TACGAGCAGG GGAAGTCCTT CCAGCGCGAC
GCCCTGTGTG CCGAGCGCGC CCGGGAGGCC GCGGCCGCCC TGGCCTACGT CAACCTTGTC
GGCGGGCAGG ACGAGCTGGT CTTCGACGGG GACTCCCTCG TCGTCGACGC CGCGGGGGAA
CTCGTCGCGC GGGCGCCGGT CTTCAGCGAG GCGCTGCTGA TCACCGACCT GGACCTGCCC
GCGGCCGGTT CCGGCCCGGC GCGGGCGGTG TCCGGGGCCG TTGACGCCGG GGCCGTTGAC
GCCGGGGCCG TTGACGCCGA GGACGGCACG ACGATGACGG TGACGCGCAC GGTACTGGCC
GCCGAGCCGC TGGCACCGTT CGCCCCCCTG CCGCCGGTCG TCGCCGACCG CCCCGAGCCC
GCCGGGGAGC TGTACACCGC GCTGGTCACC GCCACCCGCG ACTACATCCG TAAGAACGGC
TTCTCCTCGG TGGCGCTCGG GCTGTCCGGC GGTATCGACT CCGCGCTCGT CGCGACCATC
GCGGTCGACG CGATCGGCGC CGACGCCGTC CACACTGTCG CCCTGCCCTC GGGCTACTCG
TCGGGGCATT CGGTGACCGA CGCCGCCGAA CTCGCCCGCC GCCAGGGCAC GCGGCACGCC
GTGGTGCCCA TCGAGCCGAT CGCCGCCGCG TTCCGCGCCG CCGCCGCCGC CCTCGGCGGG
TTGCACGGCC TCGCGGACGA GAACCTGCAG GCCCGGGTAC GCGGAACGCT GCTCATGGCG
TTGTCGAACC AGCATGGGCA CCTGATCCTC ACCACCGGCA ACAAAAGCGA GCTGGCAACG
GGGTTCTCCA CCCTCTACGG AGACAGCGCC GGCGGCTACG CCCCCATCAA GGACGTCTCG
AAGACCCGCG TCTGGGGGCT GGCGCGCTGG CGCAACGCCG CCGCGGAGAA GCGTGGCGAG
GTGCCACCCA TTCCCGAGGA GATCATCGTC AAGGCGCCGT CCGCCGAGCT CGCCCCGGGT
CAGCTCGACT CCGACCGGCT GCCCGACTAC GGCATCCTCG ATCCCGTCCT GGACGACTAC
GTCAGCCACG ACCGGGGCCG GGCCGAGCTG ATCGCGGCCG GGCACGATCC GGCCGTGGTG
GACAAGGTGA TCCGGCTCGT CGACCTCGCC GAGTACAAGC GCCGCCAGAA CCCGCCCGGG
CCGAAGGTGA CCTCCAAGGC GTTCGGCCGC GACCGCCGAC TGCCGATCAC CTCCCGCTGG
CGCGAGAACC CGCCGGCCTG A
 
Protein sequence
MAQLRIALAQ VDTTVGDLAG NADLVSTWTK RAVADGAHLI AFGELTLTGY PPEDLVLRRS 
FVAASQRALV ALARRLAEEG CGEIAVVVGY LDASAQPAPN VGRPAGEPQN AAAVLWGGEV
VARYAKHHLP NYGVFDEFRY FVPGMAFPVL RLHGIDVGLT ICEDLWQQGG PITVARRSGV
GLVLCINGSP YEQGKSFQRD ALCAERAREA AAALAYVNLV GGQDELVFDG DSLVVDAAGE
LVARAPVFSE ALLITDLDLP AAGSGPARAV SGAVDAGAVD AGAVDAEDGT TMTVTRTVLA
AEPLAPFAPL PPVVADRPEP AGELYTALVT ATRDYIRKNG FSSVALGLSG GIDSALVATI
AVDAIGADAV HTVALPSGYS SGHSVTDAAE LARRQGTRHA VVPIEPIAAA FRAAAAALGG
LHGLADENLQ ARVRGTLLMA LSNQHGHLIL TTGNKSELAT GFSTLYGDSA GGYAPIKDVS
KTRVWGLARW RNAAAEKRGE VPPIPEEIIV KAPSAELAPG QLDSDRLPDY GILDPVLDDY
VSHDRGRAEL IAAGHDPAVV DKVIRLVDLA EYKRRQNPPG PKVTSKAFGR DRRLPITSRW
RENPPA