Gene Francci3_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1954 
Symbol 
ID3904316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2295483 
End bp2296895 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content67% 
IMG OID637879291 
Producthypothetical protein 
Protein accessionYP_481058 
Protein GI86740658 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0082937 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCGC CCCGCAGCGA CACGATGCTC GCCCGACTAC TTCATGATCG GAACTGGAGT 
TGCGACGACT TCCGCCGTGT CTTCGACACC ACCGCCCGCA CACTCAGCCG GGAAAACAAC
GATCTTCGGG TGCCGGGGCC ACTCAGCGAC CGGCAGGCGA AACGGTGGAT CGCCGGCCAG
GTCGCCACCA GGCCCTATCC GGCGGCATGC CGGGTGTTGG AGGAGATGTT TCACGTTTCC
GTGGAAGATT TGCTGACCCG GAATCCCACT CGCTACCGTG AGACACAGGC CGTAGCCCCG
CCGCCCGTGC TACCACGTCC CGTCATGGTT CCCGCTCAGG CGGAGCCCGG CAGCGGGCCG
CCCCGCGAGG AGGTGAGTCC GTCGCGCCGC CGTGACCTAC TCGGCACCGG CATCCTTCTT
GCCGCCAGCA CCGCCACGAC TGGCGCCACC AGCCGCGCCG CGCAGATCTC CCGCGCCATC
GCCGCGTCCA CCCCGGACCC CCTCAGTGTC GCCCAGCTCC ACCAGGGCAT CCAACGCCTC
ACCCGGATCT ACTCCACGAC ACCGCACGCG GATCTACTCG GACCCGTCGA ACAGGCGTGG
GACGATGCGG AAGCCCGGTT GGAAACCAGA GTCACCGGCA CCGACCGTCG CGACCTCGAA
CTCCTCGCCG GCCAATACGC CTTCTACCGT GGTCGACTCG CCTTCGACAT GGGCGATGAC
GACGGCGCGT TGACGTTCTT CGTCCTCGCC GCTCAACACG CCCAGGCAGC CGGTAACCCA
CTACTTGCCG GATCGGTTGC CGCCATGCGC TCCGCCGTGG CCTTCTTCGC CCGCGAGTTC
GAGACCGCTG CGGACATCGC CGCCCAGGCC CAGCCGCAGG CGCACCCCTA TATCGTCCCG
CTCCTCGCCT GCGCACAGGC CCGCGCCTAC GCCATCACCA GCCGCGAAGA CGAAGCACTA
ACCGCCTTGC GGACCATGAA TGATCATGTC TGGACCGGCA ACCGCCTGCC AGGACCTTCA
CCGATTGACG AAGAGTCCAG CGAAGCTTTC AATGCGGTGG TACTCGGCTA TCTCGGCAAG
GGTGACGAAG CCGAGCCACA CGCCCGCGCA TCGTTGGCGA TGCTCAGCGG AACCGGACGG
TACGTCAGCA CCGCCGGCAC CCATCTCGCA CTCGCACGGG CCTTCATCCA CCGCACCCAC
CCGGACCCCG AACAGGCAGC CACCGCAGCA AGCGACGCCC TCATGATCGT TGACGGCAAG
GAAGTGACCA ACGGCCCGAC CTTCAACCGG GCCGCCGGTA TTTGGCGCAC CCTCGCCCGC
AACGAGGAGT GGGCACGGCT CGAACCCGTC CGCGACCTGG GTGAACAGGT CTCATCTGAG
CGCCGTGCGC TCCCAGCAGG CCCAACCATC TAG
 
Protein sequence
MAPPRSDTML ARLLHDRNWS CDDFRRVFDT TARTLSRENN DLRVPGPLSD RQAKRWIAGQ 
VATRPYPAAC RVLEEMFHVS VEDLLTRNPT RYRETQAVAP PPVLPRPVMV PAQAEPGSGP
PREEVSPSRR RDLLGTGILL AASTATTGAT SRAAQISRAI AASTPDPLSV AQLHQGIQRL
TRIYSTTPHA DLLGPVEQAW DDAEARLETR VTGTDRRDLE LLAGQYAFYR GRLAFDMGDD
DGALTFFVLA AQHAQAAGNP LLAGSVAAMR SAVAFFAREF ETAADIAAQA QPQAHPYIVP
LLACAQARAY AITSREDEAL TALRTMNDHV WTGNRLPGPS PIDEESSEAF NAVVLGYLGK
GDEAEPHARA SLAMLSGTGR YVSTAGTHLA LARAFIHRTH PDPEQAATAA SDALMIVDGK
EVTNGPTFNR AAGIWRTLAR NEEWARLEPV RDLGEQVSSE RRALPAGPTI