Gene Francci3_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3000 
Symbol 
ID3905497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3554118 
End bp3555185 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content71% 
IMG OID637880320 
Producthypothetical protein 
Protein accessionYP_482086 
Protein GI86741686 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.911183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.462352 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC TGCTGGCGAT CGTCGCCACC GTCGTCCTGC TGGCGCTGAA CGCGCTGTTC 
GTCGGGGCCG AGTTCGCGCT GATCTCGGCC CGGCGGTCGA ACTTGGAGCC GATGGCCGAG
GCGGGCTCAC GACTGGCAAA GATCACCCTT GGGGCGATGG AGAACGTCTC GTTGATGCTC
GCCGGGGCGC AGCTTGGCGT CACCGTCTGC ACCCTGGGCC TCGGCGCTCT CGGGGAACCG
GCGGTCGCGC ATCTGCTCCG CGGGCCGTTC GAGGCGGCCG GGATGCCCTC CGCCCTGCTG
CACCCGGTGG CCTTCGCCAT CGCGCTGGGC CTGGTGACCT TCCTGCACGT GGTCGTCGGC
GAGATGGTCC CGAAGAACAT CGCGCTGGCG ATGCCGGAGC GTGCGGTCCT GCTGCTCGCC
CCGCTCCTGG TCGGAGTCGT CCGAGGCGGC AAGCCGGTCA TCGCGCTGCT CAACACGATC
GCGGCCGTGT CGTTGCGGCT CGTCGGTGTC GAACCGAAGG ACGAGATCGC GAGCGTCTTC
ACCCGCGACG AGGTCGCCGG ACTCATCGAG GAGTCGCATC GGGAGGGTCT GCTCGCCGAG
GACGAGCACG ACCTGCTGAC CGGGGCGCTG TCGTTCGAGG AACGCACCGC GCGTGCCGTC
CTGCTCCCGC CGGACGGGCT GGTCACCGTG TCCCCGGCGG TCACTCCCCG GCAGGTCGAG
CAGCTCGCCG CGCGCACCGG ATTCACCCGC TTCCCGGTGC GGGACCCCGA GGGCGGGCTG
ATCGGCTACC TGCACCTCAA GGACGTGCTG GAGACCCGGC CGGAGCGGCG GTCCAGCTCG
GTCGCGGCCA AGTGGATCCG GCCGCTGGCC CGGGTCGGCG CCGACGACAA CCTGCGGACG
GCACTGGCGA CAATGCAGCA CTCCGGCGCG CATCTGGCCA GGGTCACCGA CGCCGACGGT
ACGGTCCTCG GGCTCGTCGC GCTGGAGGAC ATCCTGGAGG AGCTGGTCGG GGAGATCCGC
GACGACGCCG TTCGCACGGC CGTGGTCCCC ACCCACCAGC CCGCCTGA
 
Protein sequence
MTDLLAIVAT VVLLALNALF VGAEFALISA RRSNLEPMAE AGSRLAKITL GAMENVSLML 
AGAQLGVTVC TLGLGALGEP AVAHLLRGPF EAAGMPSALL HPVAFAIALG LVTFLHVVVG
EMVPKNIALA MPERAVLLLA PLLVGVVRGG KPVIALLNTI AAVSLRLVGV EPKDEIASVF
TRDEVAGLIE ESHREGLLAE DEHDLLTGAL SFEERTARAV LLPPDGLVTV SPAVTPRQVE
QLAARTGFTR FPVRDPEGGL IGYLHLKDVL ETRPERRSSS VAAKWIRPLA RVGADDNLRT
ALATMQHSGA HLARVTDADG TVLGLVALED ILEELVGEIR DDAVRTAVVP THQPA