Gene Francci3_4359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4359 
Symbol 
ID3907331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5206433 
End bp5207530 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content71% 
IMG OID637881690 
Producthypothetical protein 
Protein accessionYP_483434 
Protein GI86743034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0687672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.636465 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGG ATACCCCTCG GGGAACACCG GCCGGAGTCC GGCCGAGCGA GGTCACGTAC 
GATCATCTGA TCATTCCGCC GGACTGGCGT CGGCCGAAGG AGTCGGCGCC GGGGACGCTC
AACGCTCTGT TCCCCTCCGA GGGAGGCCCC CGCGCCGAGC CTGCCGCGGC CGGCTCGACG
TCGCCGATCA ACCCGACAAC CGGTCCGTTG ACGGGTCCGT TGACGGGCCC GCTGCTGGTC
GGCACCACGG CGCGTGCCTT CGCGAAACTC GACGACCTGA TCCGTCTCGG TGCCGAGGAC
CGTGCCGTCG TCGCGGCTCG CCGCGACGAG GCGGAGCGGG CGCTGCGCGC GATCTTCCCG
CCTCGTTGCG CGCTGCCGCT GGTCGGGGTC GCCACGATCG GCTCGGCGGG TCGGGACACC
ATGATCCGTC CGCTCGACGA GGTGGACATC TTCGCGGTCT TCTCGGCGGC CAACAGCGCC
TGGAAGCGTT TCCGCTGGGA CTCCCGCGAC CTGCTGCTCT GCGTGCGCAA CGCCATCGGC
GGAGATCGAA TCCAGACGAT CGGTAGCCGC GGCCAGGCGC TGCGCATCGT CTACGACGCC
CCTCCGGACG TCCATCTGGT GCCGGCCTTC GATCATCCGC GTGCCGGGTA CGTCATACCC
GATCGGGTCG GTGGCTGGCT GCCGACCCGG CCGGAGCGGC ACACGAGCTG GACGGCGGAC
CTCGGCCCCC GGGTGATCTC GGCGGTCCGC CTGCTCAAGG CGTGGAACCG GGTGCGCGGC
AGCCACCTGC GCTCCTTCCA CATCGAGGCG CTCGCCGGGC AGGTCCTCGC CGGCCGCGGG
CTCAACACCC GGCAGGGGCT CGCCGAGGTC TTCCGGCATC TGGACGATGT CGGCCTCACC
GTGGCCGACC CGGCCGACGT CGGCGGGGAC CTGTCGTCGT ATCTACGCGC CGAGGACATC
GAGGCGTTGG CCGAGAGTGT CCGGCTCGCC CGGATCTACT CGGCGAAGGC GGTGGACGCG
GAACAGGCCG GCGATCACGA GGAGGCCGTG AATCTGTGGG GTTCCCTCTT CGGGCCGGAG
TTCCCCACGT TCGGCTGA
 
Protein sequence
MSVDTPRGTP AGVRPSEVTY DHLIIPPDWR RPKESAPGTL NALFPSEGGP RAEPAAAGST 
SPINPTTGPL TGPLTGPLLV GTTARAFAKL DDLIRLGAED RAVVAARRDE AERALRAIFP
PRCALPLVGV ATIGSAGRDT MIRPLDEVDI FAVFSAANSA WKRFRWDSRD LLLCVRNAIG
GDRIQTIGSR GQALRIVYDA PPDVHLVPAF DHPRAGYVIP DRVGGWLPTR PERHTSWTAD
LGPRVISAVR LLKAWNRVRG SHLRSFHIEA LAGQVLAGRG LNTRQGLAEV FRHLDDVGLT
VADPADVGGD LSSYLRAEDI EALAESVRLA RIYSAKAVDA EQAGDHEEAV NLWGSLFGPE
FPTFG