Gene Francci3_3668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3668 
Symbol 
ID3905352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4393666 
End bp4394823 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content72% 
IMG OID637880994 
Producthypothetical protein 
Protein accessionYP_482749 
Protein GI86742349 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGA TCGGCGCGGT GGATCCCGCG CTCACCGGCA CCGTCGGCTC GGGTTCATAC 
GATCCGGCGG ACGTGACCTT CCTGCTGACC GACCTCAGCG CGGTCGCCCT GGAGCGGCCC
ACCGAGGACC GTGAGGAGGC CATGCAGGCC GGCCGGCACT ACTCCGAGGA CCTGCCGGTC
GAGTACCAAC CCGACGCCGG TTACCTGGAG CTCTACCACC GGGCGCTGGA CCGGTCGGCC
CGCCGGGTCG CGCTCGCCAC CGGCCTCGTG GCGGAGCTCG TACGGGTCAC CAAGCCCGAG
CCGGTCCTGG CCTCGATCGC CCGGGCCGGT ACCCCGGTGG GGATCCTGAT GCGGCGCTGG
TACGGCTGGC GCCACGGGCT CGACACCCCG CACTACGCGA TCTCGGTGAT CAAGGATCGC
GGCGTGGACC TGAACGCGAT CCGCTACCTC ACCTCCCGGC ACGACCGGAA GGTGATCCAG
TTCGTCGACG GCTGGACCGG CAAGGGGGTG ATGACCCGGG TGCTGACCGA CGCGGTGGCC
CGGCTCGGGC TGGACGACAC CCTCGCCGTG CTGGCCGACC CGGCTCGGTG CGTGCCGCTC
TACGGGACCC GCGACGACTT CCTCATCCCG AGCGCCTGCT TGAACTCGAC GGTGAGCGGG
CTGGTCAGCC GGACCGTCCT CAACGCCGAG CACATCGGTC CTGATGACTT CCACGGTGCG
AAGTACTACG CCGAACTCGC GGCGCACGAT CTGTCCCGGC ACTTCGTCGA GACGGTGGCG
GCCCACTTCC CCGAGGTGGC CGACGAGGTC GCCGAGACCT GGCCGCGGCT GTGGGCCGCC
GACCGCAGTC CGACCTGGGC GGGCTGGGCG GCGGTGGAGC GCATCGCGGC CGCGTTCGAC
ATCCCCGACG TGGTCATGGT GAAGCCGGGG GTGGGGGAGA CGACCCGGGT CCTGCTGCGC
CGGGTGCCCT GGCGCATCCT CGTCGCCCCC GACCGGCTCG ACGAGCTGAC GCACGTGCTG
GCCCTGGCCG CCGACCGCGA CGTCGAGGTC CAGGAGCTGG CGGACCTGCC ATTCTCCTGT
GTGGGGCTGG TCCGCCCGGT CGGCGCGGCA CCGGTCTTCC ACACCCCGAA CGCGCGCTGG
CGCCCGGACG TCCTGTGA
 
Protein sequence
MSGIGAVDPA LTGTVGSGSY DPADVTFLLT DLSAVALERP TEDREEAMQA GRHYSEDLPV 
EYQPDAGYLE LYHRALDRSA RRVALATGLV AELVRVTKPE PVLASIARAG TPVGILMRRW
YGWRHGLDTP HYAISVIKDR GVDLNAIRYL TSRHDRKVIQ FVDGWTGKGV MTRVLTDAVA
RLGLDDTLAV LADPARCVPL YGTRDDFLIP SACLNSTVSG LVSRTVLNAE HIGPDDFHGA
KYYAELAAHD LSRHFVETVA AHFPEVADEV AETWPRLWAA DRSPTWAGWA AVERIAAAFD
IPDVVMVKPG VGETTRVLLR RVPWRILVAP DRLDELTHVL ALAADRDVEV QELADLPFSC
VGLVRPVGAA PVFHTPNARW RPDVL