Gene Francci3_1863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1863 
Symbol 
ID3906138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2197228 
End bp2198688 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content63% 
IMG OID637879201 
Productradical SAM family protein 
Protein accessionYP_480968 
Protein GI86740568 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.300057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.908691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTTC TCCTTATCGC CATGCCATGG CAGGGACTTG ACACACCGTC CAGTGCGCTG 
GGAGTACTCG GGCCTTCCGT CCGCGAACAG GCGCCCGGCT GGTCCGTGGA CGAGTTGTAC
GGAAACTTCC GGTGGGCTGA GCACCTGATG TGCGCTAGCG GCGGAGCCAT CGGCGTGGCG
GACTACGGCA AGGTGGCGGT CCAGGTCTTT CAGGGCGTCG GCGACTGGGT CTTCGCTCCC
GCGCTGTATG ACGTCGCGAG CTACCGTGTC GACGAGTACG CGCAGCTTCT CGACGCGCAG
GGTGTCGACG CGGAGGTTCC CGTCGAGATG CAGCGACACT CCCGGACGTT CATCCGGCAG
CTGGCGGCCG AGATCGCCGC CGATCCACCG GACATCGTCG GATTCACCAG CACTTTCATG
CAGAACGTTC CGTCACTCGC GCTCGCGAGG GAGATCAAGA GAGTCGCGCC GGGCGTCCTC
ACGGTGATCG GCGGCAGCAA CTGCGACGGG CCGCAGGGGC CGGCCCTGCA CCGGAACTTC
GATCAGCTCG ACTTCGTGAT CAGTGGTGAA GGCGAACGGT CGCTGCCCGC GCTGCTGAGG
TGTGTCGCCG CAGGTGCGAG TGTCGCCGAC ATACCCGGCC TCAGCTGGCG GTCCGACGGG
ATGACGGTCA CGAACCCACC CGCCGAGTCG TCCGTGCCCT TCGGCGTGGT GCCAGCCCCC
GACTACGACG GCTACTTCCA GGCGCTCGAG AATTCGTCGC TCGGCCCCGG CATCCGGCCG
ATGGCGGTTC TCGAGACATC CCGTGGCTGT TGGTGGGGCG AAGTTCACCA GTGCACCTTC
TGCGGCCTGA ACGGATCGAA CATTAACTTT CGGAGCAAGG CTCCCGAACG CATCGCGCAC
GAGGTCCGGG ACCTGGCGTC GAGGCACCGC GTTCTCGACG TGGTGATGGT CGACAACATT
CTCGACATGG GCTACATCGA TAAGGTGATG CCGGAGCTGG CGGCCCTCGA CTGTGATCTG
AGGATTCACT ACGAGATCAA GTCGAACATG ACCCGCGAGC AGCTGGGCCG CCTGAGGGAC
GCGAACGTGC TCTTCGTCCA GCCGGGCATC GAGAGCCTGA GCAGCCACGT GCTTCGGCTG
ATGGAAAAGG GCGTGAGTTC GGCGCACAAC GTGCGTATGC TGCGAGATGG CATGGATCTC
GGCCTGAGCG TCACCTGGAA CATCCTGTAC GGATTCCCTG GGGAGACCGA CGAAGATTAT
CAGAGCCTGC TGAGGAAAAT GGCATCACTG GAACACCTCC AGCCGCCAAC CGGCGCGTGG
CGCATTGCAC TGGAGAGGTT CAGTCCCTAT TTCGATGATC CTTCCATTGG ATTCATGTTC
CGCACGCCGG CGCGCTTCTA TGAACTTATC TATAATGTTC CAAAAGGCGA GTTGTATGAT
CTCGTCGTAA GAGTTCGGTG A
 
Protein sequence
MRLLLIAMPW QGLDTPSSAL GVLGPSVREQ APGWSVDELY GNFRWAEHLM CASGGAIGVA 
DYGKVAVQVF QGVGDWVFAP ALYDVASYRV DEYAQLLDAQ GVDAEVPVEM QRHSRTFIRQ
LAAEIAADPP DIVGFTSTFM QNVPSLALAR EIKRVAPGVL TVIGGSNCDG PQGPALHRNF
DQLDFVISGE GERSLPALLR CVAAGASVAD IPGLSWRSDG MTVTNPPAES SVPFGVVPAP
DYDGYFQALE NSSLGPGIRP MAVLETSRGC WWGEVHQCTF CGLNGSNINF RSKAPERIAH
EVRDLASRHR VLDVVMVDNI LDMGYIDKVM PELAALDCDL RIHYEIKSNM TREQLGRLRD
ANVLFVQPGI ESLSSHVLRL MEKGVSSAHN VRMLRDGMDL GLSVTWNILY GFPGETDEDY
QSLLRKMASL EHLQPPTGAW RIALERFSPY FDDPSIGFMF RTPARFYELI YNVPKGELYD
LVVRVR