Gene Francci3_4350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4350 
Symbol 
ID3907322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5192761 
End bp5194992 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content68% 
IMG OID637881681 
Productprotein of unknown function DUF224, cysteine-rich region 
Protein accessionYP_483425 
Protein GI86743025 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGATG CTGTGAGAAT TGCGATCGGT GGAGCGATCA CGCTGATTGC TCTCGCGGTT 
GCCGGGCGTC GGGTTTTCTG GCTGTTCCGG CTGATCGGGT CCGGCCAGCC GGCCGAGGGC
CGGCTCGACG ACCTGCCGAC CCGCGTCTGG ACGGAGATCA GCGAGGTCGG GGGGCAGCGC
AAGCTGCTGA AGTGGTCGGT GCCCGGGGCG GCCCACTTCT TCACCTTCTG GGGCTTCACG
ATCCTGCTGC TGACCGTCAT CGAGGCCTAC GGCGGCCTGT TCGACGACGA CTTCCACATC
CCGGGATTCG GGCACTGGGC GGCCATCGGC TTCCTCGAGG ACTTCTTCGC CGTGGCGGTC
CTCGCCGGCC TGGTCGCCTT CACGATCATC CGGTTGAAGA ACGCCCCGGC GCGGCTGGAG
CGCAAGTCGC GATTCTACGG CTCGCACACC GGGCCGGCCT GGGTCATCCT CGGCCTGATC
ACCGCCGTTA TCGTGACGCT GCTGCTCTAC CGTGGTGCCC AGTACAACAC CGGCAACCTC
CCGCAGGGCC AGACGAAGTG GGCCTTCGCG TCCTACGCTG TGAGCCGCAT CTTCTCCGGT
CTCTCCCACG ATGTGAACGA GGGCATCGAG ACGGCCTTCC TGCTGCTCAA CATCGCGGTG
ATCATGGGCT TCTCGGTGCT GGTGGTCTAC TCGAAGCACC TGCACATCGG CCTGGCCCCG
ATCAACGTCG TGCTCAAGCG CCAGCCGGTG GCCCTCGGCC CGCTGGCCAC CACTCCCGAC
ATCGAGAAGC TGATGGAGGA GGACGAGCCG GTCGTCGGCG TCGGCAAGGT CGAGGACTTC
TCCTGGAAGG CCATGCTCGA CTTCGCCACC TGCACCGAGT GCGGGCGGTG CCAGAGCCAG
TGCCCGGCGT GGAACACAGG CAAGCCGCTG TCGCCGAAGT TGTTGATCAT GGATCTGCGT
GACCACCTCT TCGCCAAGGC GCCCTACCTG CTGTCCACCG AAGGCGCGGC GGAGGGTGAG
GAGGCGCCGA AGGCGGTCAC CGGGATCGCC GAGGACGCCT CCGCCTCCCA CACCGTGCAC
CACGTGCCCG AGTCCGGCTT CGGCCGGGTG CCGCAGCCCG GTCAGCCGCA GGTCGACCGG
CCGCTGGTCG GCACCGAGGA GGAGGGCGGG GTCATCGATC CCGACGTCCT GTGGTCGTGC
ACCAACTGCG GCGCGTGCGT CGAACAGTGT CCGGTGGACA TCGAGCATGT CGACCACATC
GTCGACATGC GTCGCTACCA GGTGATGATC GAGTCGGCGT TCCCGTCCGA GGCCGGCGTC
ATGCTGCGCA ACCTTGAGAA CAACGGCAAC CCGTGGGGCG TCTCGCCGCG GACCCGCACC
GAGTGGACCG AGGGCCTGCC CTTCGAGGTG CGTGTCCTCG GCGACGGTGA GCAGATCCCC
GACGACGTCG AGTACCTGTA CTGGGTCGGC TGCGCCGGGG CGATCGAGGA CCGCGCCAAA
AAGGTCGCGC GGGCCTTCGC GGAGCTCCTG CACACTGCCG GGGTCGAGTT CGCGATCCTC
GGTACGAACG AGTCCTGCAC CGGGGACCCG GCGCGCCGGC TCGGCAACGA GTACCTGTAC
CAGGAGATGG CCAAGGCGAA CATCGAGCTG CTGAACGCCA CGGGCGTCAA GAAGATCGTT
GCCACCTGCC CGCACTGCTT CAACAGCCTC GCCCGGGAAT ACTCGGCCCT CGGCGGTCAG
TACGAGGTCG TCCACCACAC CCAGCTGCTC GGCAAACTGG TCGAGGAGCG CAAGCTGGTA
CCGGTGACCC GGGTGGAGTC GTCGGTCACC TACCACGACC CCTGCTTCCT CGGGCGGCAC
AACAAGGTCT ACACCCCGCC GCGGGAGATC CTGGCGGCGA TTCCGGGCAT CCGGGGCCAG
GAGATGCACC GCTGCAAGGA CCGCGGCTTC TGCTGCGGTG CCGGCGGCGC GCGGATGTGG
ATGGAGGAGA AGATCGGTAA GCGGGTCAAC GTGGACCGGA TGGAGGAGGC CCTCGGCCTC
GACCCGGACG TCGTGTCCAC CGCCTGCCCG TTCTGCATCG TGATGCTCAC CGATGCCGTC
ACCGAGAAGA AGCTGAGCGG CGAGGCGAAG GACGGTGTCG AGGTACTCGA CGTCTCCCAG
CTCCTGGCCC GTTCGCTGGC GCCGTCGGCG CCAGCGGCTC CGACCGAGGC GTCCGCCGAG
CCTGTCGGCT AA
 
Protein sequence
MEDAVRIAIG GAITLIALAV AGRRVFWLFR LIGSGQPAEG RLDDLPTRVW TEISEVGGQR 
KLLKWSVPGA AHFFTFWGFT ILLLTVIEAY GGLFDDDFHI PGFGHWAAIG FLEDFFAVAV
LAGLVAFTII RLKNAPARLE RKSRFYGSHT GPAWVILGLI TAVIVTLLLY RGAQYNTGNL
PQGQTKWAFA SYAVSRIFSG LSHDVNEGIE TAFLLLNIAV IMGFSVLVVY SKHLHIGLAP
INVVLKRQPV ALGPLATTPD IEKLMEEDEP VVGVGKVEDF SWKAMLDFAT CTECGRCQSQ
CPAWNTGKPL SPKLLIMDLR DHLFAKAPYL LSTEGAAEGE EAPKAVTGIA EDASASHTVH
HVPESGFGRV PQPGQPQVDR PLVGTEEEGG VIDPDVLWSC TNCGACVEQC PVDIEHVDHI
VDMRRYQVMI ESAFPSEAGV MLRNLENNGN PWGVSPRTRT EWTEGLPFEV RVLGDGEQIP
DDVEYLYWVG CAGAIEDRAK KVARAFAELL HTAGVEFAIL GTNESCTGDP ARRLGNEYLY
QEMAKANIEL LNATGVKKIV ATCPHCFNSL AREYSALGGQ YEVVHHTQLL GKLVEERKLV
PVTRVESSVT YHDPCFLGRH NKVYTPPREI LAAIPGIRGQ EMHRCKDRGF CCGAGGARMW
MEEKIGKRVN VDRMEEALGL DPDVVSTACP FCIVMLTDAV TEKKLSGEAK DGVEVLDVSQ
LLARSLAPSA PAAPTEASAE PVG