Gene Francci3_3358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3358 
Symbol 
ID3905940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3982682 
End bp3984688 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content70% 
IMG OID637880681 
Producthypothetical protein 
Protein accessionYP_482442 
Protein GI86742042 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.798648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0325979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTGG ACGTTCGTCG TGTCCTGAAC TCTTTTGTTC CCCGGCGGCA GGCCTGGGCC 
CGGCTGGACG AGACCGTCCA GGACACGGGC CTGGCCGATG ACACCACCGG AGTCCTCAAT
CCCGACTTCC TGGCCCTCGG CCTGGGCGCC ACCAACATGA TGGGAATGCT CTGGTCGCTG
GCCCTGGGCC GGCGGGTGGT GGGCGTCGAG CTGCGGGGTG ACCCCTCGCT GGGCGTGCAC
TGGAACATTC GCGAGGACCT TTACCACCAC CTCGGCCTGA TCGACCAGAT GATGCTCGAG
CGCTACGGCG AGGCGGGTAT CCCCCGCCGG GGCGACGGGA GCCTGTTCAT CCTGGCCGAG
TGCTTCTACC GGCCGGACAC GCCCGCCGGC GCCGTCACGG CCGACGAGGT GGTCAGCGGA
TTCCTGGACG CGCTGGTCGG CGAGCCCGCG CGGATCGGCG GCCGGATCTT CCACACCGAG
TTCATCGACG ACCGTTGGAA GGACGGGAAG CCGCACCGCA CGGTGACCGT GCTGACGCCG
CCGGAGCCCC CGCGCCGGCC CGACCCCACC CGGGTGGGGC GCAGCACCCT CGAGGCGCTG
GAGGGGCCCT CCACGTTCCA GAGCGCGGCC TCCGAGGTGA TGGTGCTGAT GCGCCGCTAC
CTCGAAGGGG TCGAGCGGAT GGACCTGGCC CGGGGCGTCA CGCCGCGGGT GCGGTTGTTC
CTGTCCCACC GGGTGGTCAC CACCGGGACC GGGAGCGACG CGGGCTATCT GAAGTGGCTG
CGCCGCGAGG AGGGATTCGG GGACGCCTCC GGCGGTCGCA AGTCCATCCG CATCGAGCAG
GTAAGGGAGC TGGACTACAA CGGCAGGTTC CACCGCGTGC GGGTGCCGGG CAGCAAGGTG
ATCGACATCG GGATTCCCGA GCTGTTCATG ATCGCGCAGG GTTTCAACAG CACCGACGCC
GACCGCCTAG GCTTCAGGCA GGAGGACGTG CTGGTGGACC ACCACGACGG CCGCGGGCCG
GTCGTGGCGC AGGCCGACTA CCTCGCCGGC CTGCTGGAGG TGCTGGTCGA CGGCCGGCTG
CGGCGCAGGA TCGCCTCCGA CTTCGACAAG GAGGGCAACG AGTACTGGGT GCGGCAGATC
GCGGTGGGGC ATGAGGACGA CGCCGAGGTC GGCTGGATCC TGGTGCAGGT TCCCGACTAC
AAGACGTTCG ACCCGATCCT GTCCGGCCTG GTGCCGCCCG GTACCTACCG CAAGTCCAAG
CAGTACCGGG CCGGCGTGCA GCACCTGATG CGGGAGTTCT ACCTGGACCA GGTCTCGCAG
ATCTGCGAGA TGCCGGTGTC GGAACTGGAG AGGATCCAGA TGCCGTACGG TCCGAAGCTG
TTCAGCCTGG TCGAGCGGGC CGGGGTGGAC GCCCAGGTCG CGGCCAACGG GGTCGTCGCC
GGGGACACCT TCGGCAACGG CCATTTCCTC ACCAGCGGCG GCGCCATCAC GGGCATGATC
GGTCACGGGC ACCGCGTCAA GCTGTACTGG GAGGCCCGGG ACGCCGGAGT GCCCCACGAG
CAGGCCATAC GCGGCCTGGC CGACGGCATC AAGCAGGACA CCGACGACTG GTTCGCGGTG
AGCGCGCAGG AGTTCAGCAC CGCCCTGCCG ATCAACTTCG GCTCCGAACG GATCGCCACG
ATCGAGGCGG CCGGCGGACA CCGGTCGTCC GCCCGGGCGA CCACGATCGA CGCCACCCGC
CGTCACCGGC ACACCCTGGT GCCGCTCGAC CCGTCGGACT GGCGCCGGTT GCTGGTGCGC
AGCGGGCGGA TGCACGCCCT GGCCCTGCCC CCGATCCCGA TGACCCACCC GGTCGTGCGC
GGCGGTGGTG GGCTGCCTGA TCCTGCCGAC GCCCAGCAGG ACGGTGCTAC GGCCGGGATG
GGGGCCGGGA TGGGCGGTTG CATGGTCCAG CCGGGCGGTG CCATGGGCGG GATGGACGGT
GCGATGGCCG AGGTGGCAGC TCAGTGA
 
Protein sequence
MGVDVRRVLN SFVPRRQAWA RLDETVQDTG LADDTTGVLN PDFLALGLGA TNMMGMLWSL 
ALGRRVVGVE LRGDPSLGVH WNIREDLYHH LGLIDQMMLE RYGEAGIPRR GDGSLFILAE
CFYRPDTPAG AVTADEVVSG FLDALVGEPA RIGGRIFHTE FIDDRWKDGK PHRTVTVLTP
PEPPRRPDPT RVGRSTLEAL EGPSTFQSAA SEVMVLMRRY LEGVERMDLA RGVTPRVRLF
LSHRVVTTGT GSDAGYLKWL RREEGFGDAS GGRKSIRIEQ VRELDYNGRF HRVRVPGSKV
IDIGIPELFM IAQGFNSTDA DRLGFRQEDV LVDHHDGRGP VVAQADYLAG LLEVLVDGRL
RRRIASDFDK EGNEYWVRQI AVGHEDDAEV GWILVQVPDY KTFDPILSGL VPPGTYRKSK
QYRAGVQHLM REFYLDQVSQ ICEMPVSELE RIQMPYGPKL FSLVERAGVD AQVAANGVVA
GDTFGNGHFL TSGGAITGMI GHGHRVKLYW EARDAGVPHE QAIRGLADGI KQDTDDWFAV
SAQEFSTALP INFGSERIAT IEAAGGHRSS ARATTIDATR RHRHTLVPLD PSDWRRLLVR
SGRMHALALP PIPMTHPVVR GGGGLPDPAD AQQDGATAGM GAGMGGCMVQ PGGAMGGMDG
AMAEVAAQ