Gene Francci3_3845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3845 
Symbol 
ID3905593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4605835 
End bp4607430 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content63% 
IMG OID637881171 
Product4Fe-4S ferredoxin, iron-sulfur binding 
Protein accessionYP_482924 
Protein GI86742524 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.130355 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGGA CGATAATCGT GGGATCGGGT CCCGCTGCCG CGGGAGCGGC TCTGGCACTG 
GCCGAGGACC CGACCCAGGA AATCCTGGTC GTGGACATCG GGACGCGGCT CGAGGCCGAA
GCTGCCAACA TCGTCGACGA GCTCGGTCGC TCAGAGTCGG ACGAGTGGCG TTCCGACCAG
GTTAAGATCA TCAGTCGTCA GCCGGTGCCC ACCTCGCGCC GGGCCCTTCC GGAGAAGCGG
GTCTACGGAT CCGACTTCCC CTTCCGGGAC GAGGGGCAGC TCCGCGGGGT GAGTGCGATG
GTGGGTGCGA ACCGGGCCGT AGTCTCCAGC GCGTACGGCG GTTTCAGTAA TGTGTGGGGC
TCGCAGTTGA TGCCCTTCAG CAAGGCGACT TTCGGCCATT GGCCCTTCAC CCTCCAGGAG
TTGGAGCCGC ATTACCGCCG AATCCTTGAC CATGTCCCGA TGTCCGGCGC GATCGACGAT
CTGGCCGAGT TGTTTCCATT GTTCCGGGAG CCACGACCGC TGCCGACCCT GGCACCGCGG
ACCCGCCAGG TTATTGGTGA CTACGGTAGG CATCGGGAGC GGCTGCGCCG CGACGGCATC
ACCGTCGGAA AGGCTCGCCT GGCCCTCCGG GGTGGCGAGT GTGTCCGCTG CGGTCTCTGT
ATGACAGGCT GTCCTTACCG GCTGATCTAT TCCTCCGCAC AGACATTCGA CCAGTTGATC
AGCGAAGGTC GGGTCAAGTA CCTGCCGGGA CTCCTGGTAT TCCGGGTTGA GGAGTCCGAG
TCCGGGGCCC GTCTGTTCGC CCGGGAGACC GTGAGTGGCC AGACCCAGGA GCTAGAAGGA
GATCGGATCT TTCTCGCCGC GGGCGCCCTG GGCAGCACGA GGATTGCTCT GAACTCGTTG
CGTTGGTTCG ATGAGCGGAT CGAGCTCGCC GAATCTGCGC AGTTCACGGT TCCCATGCTG
TCGCGGCGCC GGACACCTGA TCCGCGTGCG GACGGGGAGA TGACTCTCAA CCAGTTCAAT
ATGATTGTTG TGCTGGATGA CGTCGGTCTC GATGTATCAC AGATCCACTT TTATCCCTAT
AATCCCGCGA TCACCGAGGC GCTTCCGCGG GTCCTGCAGT CCGGTTGGGG CTCCGTACTT
GGTGACGAGG TTCTTCGGCG GCTTACGGTC GGTCTGGGTT ATCTCCCGTC GTGGGCCTCT
CCGCGGCTAC GGGTTCAGGC TGTCCCGTCT GCTGGTGAAG CGGATCTCCC GGAAATTCAC
CTGAGCGGTA CCCGTGACGG CGTCCTCGGA AATGGGCTGC TCCGGAAGGT GGTGCGACGG
CTCGTCACCG CCGCCCCGTT CCTCGATCTG TGGCCGGTGG TGCCGATGGT TTCCTGTTCG
GCGCCGGGTA AGAGCTACCA CTGGGGTGGG AGCTTCCCGC ATTCCTCCGG GGTTTCGGGA
ACCCGGCATA CAAGTGACAT GCTTGGCCGG GTAGGGGGGT GGCGGCGTAT TCACCTGGTT
GACGCATCGG TGTTCCCTTC GGTGGCGGCG ACGACATTCA CTTTTACGAT CATGGCAAAT
GCACACCGAA TAGGATCAGA GGTGCGAGGG TTGTGA
 
Protein sequence
MSRTIIVGSG PAAAGAALAL AEDPTQEILV VDIGTRLEAE AANIVDELGR SESDEWRSDQ 
VKIISRQPVP TSRRALPEKR VYGSDFPFRD EGQLRGVSAM VGANRAVVSS AYGGFSNVWG
SQLMPFSKAT FGHWPFTLQE LEPHYRRILD HVPMSGAIDD LAELFPLFRE PRPLPTLAPR
TRQVIGDYGR HRERLRRDGI TVGKARLALR GGECVRCGLC MTGCPYRLIY SSAQTFDQLI
SEGRVKYLPG LLVFRVEESE SGARLFARET VSGQTQELEG DRIFLAAGAL GSTRIALNSL
RWFDERIELA ESAQFTVPML SRRRTPDPRA DGEMTLNQFN MIVVLDDVGL DVSQIHFYPY
NPAITEALPR VLQSGWGSVL GDEVLRRLTV GLGYLPSWAS PRLRVQAVPS AGEADLPEIH
LSGTRDGVLG NGLLRKVVRR LVTAAPFLDL WPVVPMVSCS APGKSYHWGG SFPHSSGVSG
TRHTSDMLGR VGGWRRIHLV DASVFPSVAA TTFTFTIMAN AHRIGSEVRG L