Gene Francci3_3375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3375 
Symbol 
ID3905957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4004941 
End bp4006863 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content68% 
IMG OID637880698 
Productradical SAM family protein 
Protein accessionYP_482459 
Protein GI86742059 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.131592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCAAT CTTTGCCGGG AGACAACCCT GCTCTCCACC GACAGTTCCC GACGGCGTTC 
CCGCACTTGT CGAGATCGCT CGTTCGCGGC CGAATCCGCC CAGGTAAAAG CAATGGGTTT
GGGGGAATTT TGGAACAGGT CTATCCGCAT ACCGGGGTGT GCTACACCTA CGGGAACCAC
GAGCGCGTGT ACCTGATCTA CACCTATGCC TGCAACGCGA CCTGCGCACA TTGCCTGGTC
CAATCGAGTC CACATCGCCG GGAGAGGTTC GACCTGGCCA CCGCGATAGA AATCCTGCGG
ACGGCAGCCA GGTTCGGTCG CAGGTTCCTC GACCTGGGCG GCGGCGAGAT CATGCTTCAT
CCGGAGGACA CCTGTGCGCT CGCCCGCGCC GCCACGGATC TCGGGTACTA CGTCTCCCTC
AACACGAACG GCTTCTGGGC CCGCACCCCC GAACGCGCCC GTACGCTCGT GAGCCGTCTC
CAGCAGGCCG GGGTCCAGGC GATCTTCCCG AGTGCGAGCG CCTTTCACCT GCCGTTCGTG
CCGGTCGAGC GGCTGCGCCA CCTCCGCCGG GCCTGCGCCG ACCTCGGCAT GGTGCACGAG
CTGAGCTGGG TGGCCTCCCA CCTGCCCGAC GTGGACGCCC AGCTGCTGGA CGACCTGGAC
CTGGGCGGTG AGACGGTCTA CCCCAACAGC CTGACCACCG AGGGCAACGA CCCGGAAGTC
ATGGCGGCTC TCACCAGGCA CTACAGCCGG TTCACGCCGG ACCGGCTCCC CGACTGTGGA
AGTCTGCAAC TGGGCGTCAA CCCGCGCGGG CATGTGATAG CCACCTGCGA GATGACCAAT
CTCAACGAGA AGTTCCGCGG CACCGCACTG TTCATCGGAG ATTTCACCCG GACGCCGTTC
GAGCAGCTCC TGGAAGCCGA ACGGGACACG GCCGTGCTGC AGTTCCTCTA TCATAACCCG
CCGGCCGCGC TGCACGATCT GCTGCTCGCC GATCCGCAGG AAGGCGGGGG ATACCGGCAC
CGGTATGCGG ATCGCACCTA TCACAGCGTC ACGGACTACC TCGCCGACCT GCTGCGCGAC
GAGAACGTGC CGGCCGCGGA GCGCGCCATC GCCCGCGCGT CCGATACCCG CGCGTCCCAT
GCCAGCGCGT CCCATGCCAG CACAGCGGTT CCCGGCAGGT TCCACCCGGT GATCTCCCGC
GCGTCCCCAC GTCTAGACCC CGCCGGGAAC CGCCGGCCCG CCGGTAATCC GCGATCCGAT
TCAACGCCTG GTCGAGAGGA ATTCATCGAT GGCGGAGAAA ACGCTAACCT CCAGCTCTTT
TCGTCTCCGT GTGGACCTCC TGGCTCCCCA CATCATGTCG GCCATGGAGG CTTTCGACGC
CGCCACCGAC AAGGTCAGCC TGCCGGCCTC GCTCCTGGAA CTGGTCCGCA CCCGGGCCTC
CCAGATCAAC GGGTGCGCCT TCTGCGTCGG CGCCCACAGC CCGGCCGCCC TCGAGGCCGG
CGCGACCCAG AAGCAGCTGC TGGCCCTGCC CGTCTGGCGC GAGTCGCCGC ACTTCTCGGC
ACAGGAACGT GCGGCGCTGA CCCTCACCGA GGCCATCACC CAGATGGACC GCCGGCCGGT
CACGGACGAG ATGTGGGGCG AGGTCTCGGT GGTCCTCACC GAGGTGGAGC TGGCCGAACT
GGTCTGGGTG ATCGCGGCGA TCAACGTCTG GAACCGCGTC GCCGGTACCG CGCGTCCCTG
GCCAGTGGCC TGACATGCGG ATCGGACTCG TCGGGGCCGG CCGAATCGGC GCCGTGCACG
CCCGCACGCT GGCCGACGAC CCCCGGGTGG ACGAACTGGT CATCACCGAC GTCGACCAGG
AGCAGGCCGC CCGCGGGGCG AGCGCGGCCG GGGCCCGGGT CGCGGCGGAT CTCGAGGCGC
TGA
 
Protein sequence
MTQSLPGDNP ALHRQFPTAF PHLSRSLVRG RIRPGKSNGF GGILEQVYPH TGVCYTYGNH 
ERVYLIYTYA CNATCAHCLV QSSPHRRERF DLATAIEILR TAARFGRRFL DLGGGEIMLH
PEDTCALARA ATDLGYYVSL NTNGFWARTP ERARTLVSRL QQAGVQAIFP SASAFHLPFV
PVERLRHLRR ACADLGMVHE LSWVASHLPD VDAQLLDDLD LGGETVYPNS LTTEGNDPEV
MAALTRHYSR FTPDRLPDCG SLQLGVNPRG HVIATCEMTN LNEKFRGTAL FIGDFTRTPF
EQLLEAERDT AVLQFLYHNP PAALHDLLLA DPQEGGGYRH RYADRTYHSV TDYLADLLRD
ENVPAAERAI ARASDTRASH ASASHASTAV PGRFHPVISR ASPRLDPAGN RRPAGNPRSD
STPGREEFID GGENANLQLF SSPCGPPGSP HHVGHGGFRR RHRQGQPAGL APGTGPHPGL
PDQRVRLLRR RPQPGRPRGR RDPEAAAGPA RLARVAALLG TGTCGADPHR GHHPDGPPAG
HGRDVGRGLG GPHRGGAGRT GLGDRGDQRL EPRRRYRASL ASGLTCGSDS SGPAESAPCT
PARWPTTPGW TNWSSPTSTR SRPPAGRARP GPGSRRISRR