Gene Francci3_3067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3067 
Symbol 
ID3904268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3636164 
End bp3637339 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content75% 
IMG OID637880388 
Producthypothetical protein 
Protein accessionYP_482153 
Protein GI86741753 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.293034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCATG ATGGTGCGGG GGGCCCGGGA TCGCCGGGAC GTCACCGTTC TCCCTGGGGG 
CGTTCAGGCC GGATCCTGCC GATCCGGTGG GGACGCATCA TCGCTGTGGT GGCTGTGTTG
GTCGTCATCG TTGCGGGTCT GGTGGTTGTG CGTGGGCGGA TGGCCGGCGA CCATCGTGCG
GGTTCCTCCG GAGGCAGGCC GGGTGCGCCG GTCGCAGCGC CGGCACCCAC GCCCGGTGAC
GAGGACGGGC CGGCGGACGT GACGGCGGCC CCGGTCACTG CCCGGCCGGG TGCCGGTCCG
GCCCTTTCCA CCGCGTCCGC GGCCTCCCCG AGCACCGGCA TCGGGTCGGT TTCGCCACGG
GTTGCCCCGA CCGCCGCGAG GCCGCGGACC GGACCGCCGC CCGGGTCACC GGCCGTCCCG
GTCGCTGCGC GGCCCGCCGG CCCGGCCCTA AGCCTGAGCT CGACCAGGGT CGACCTGGGT
GACGTCGACT CCGTCTGGCG GCTTGACCTG CGTGGCGAGG GCACGGCGCC GGTCGACGTC
GTGATCGGCG CCAGCCCGTC CTGGCTAACC GTGGTGCCCG GCCGGTCGCG GATCGATCCG
GGGGCCGTCG TCCCCCTGGT GATCACTCTT GATCGGGCGG CGGCACCGTC CGGACCGATC
GATGTGACCG TTCCAGTCAG GCCCCGCAAG GGTGATGGCG GGGGCGACGT GCGGGTCACC
GCCAGGGTCG ACGGCTCGCC ACAGGTGGTG TCGATGGTCG CGGCGCCGAC GACGCTGTAC
CCGTCCGGCT GCGCGCCGGC CGCGAGGGTG ACCGCGTCCC CGGTGACCGC GTCCCCGGTG
ACCGCGTCCC CGGTGACCGC GTCCCCGGTG ACCGCGTCCC CGGTGACCGC GTCCCCGGTG
ACCGCGTCCC CGGTGACGGT GTCCACGGTG ACGGTGTCCG TCGTGGACGC GACCGGCATC
TTCGCCGCGG AGCTCGTGGT GAACCTCCCC GACGGAGGCA GGGCTAGTGT GTCGCTGGTC
CTTGACCGGG CGACGGGTAA CCGGTCGACG TGGTCCGGCC CGGTGGGCCC GGCCCGTGTG
GCCGGGGCGA TCACCTACAC GGCCACGGTG ACCGATCTCG ACGGGCGGCG CTCCCGGGCA
CCGGGATCCC TCACCGTTCT GCCCTGCCCC TCCTGA
 
Protein sequence
MTHDGAGGPG SPGRHRSPWG RSGRILPIRW GRIIAVVAVL VVIVAGLVVV RGRMAGDHRA 
GSSGGRPGAP VAAPAPTPGD EDGPADVTAA PVTARPGAGP ALSTASAASP STGIGSVSPR
VAPTAARPRT GPPPGSPAVP VAARPAGPAL SLSSTRVDLG DVDSVWRLDL RGEGTAPVDV
VIGASPSWLT VVPGRSRIDP GAVVPLVITL DRAAAPSGPI DVTVPVRPRK GDGGGDVRVT
ARVDGSPQVV SMVAAPTTLY PSGCAPAARV TASPVTASPV TASPVTASPV TASPVTASPV
TASPVTVSTV TVSVVDATGI FAAELVVNLP DGGRASVSLV LDRATGNRST WSGPVGPARV
AGAITYTATV TDLDGRRSRA PGSLTVLPCP S