Gene Francci3_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0789 
Symbol 
ID3905725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp918877 
End bp919995 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content75% 
IMG OID637878122 
Producthypothetical protein 
Protein accessionYP_479902 
Protein GI86739502 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.236873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGAGG TCCCGGAGGT CGCAGACCGA CGCTCTGTCC TGGTTGCCAC CGGCCTCGCG 
GCGGTGGGCT CCGTGAGTCT GCGCGGCGAC GCGTCCGGCC TGCACGAAGC CTGCATGGCA
GCCCTGACCG CGACGGTCAG GGCCTACGAG ACGTCTGATC TGATCACTAC CACGTCCGCC
CTGGTCTCGC TGGAGCGCAC GACACAGACC GTGCCCCGGG AGCACCGGCT GCGCGGCCGG
CCGGCGCTGG AGTGGATGCG GCTGCACGCG GCGGTCACGA CGGTGGCTGC TGCCACCGCC
TACGACCGCG GCAAACACTC CGACGCCGCT GCGCGGGCCG ACCGGGCCGC GGCGCTCGCC
CGGGCGGCCG GGGACGGGCC ACTCGCCGCC CGCGCCCTGG CCCTGCGCGC CCGGGTGGTC
CGCCCACACA GCCCGGCGGT GTCCTTGCAG ATCGCCGGGG CGGCGGCGCG GATCGCGGGG
CACAGCTCCA CCCGGGCCCT CATCGCCGGG AAGGTCGTCA CCAGCGCCTG TGCGGCGACC
GGCGACCGGG CAGGCGTCCG GGACGCCGTC GCCAGAGCCT GGCAGACGAT GGAACGGCTC
GACGACACCG CCCACGGCTC GCCGGGTTTC TCGCTCGACA CTTACTCCCC GGCGGATCTG
GCCTTGGCCT GCGCCGAGGC CCTGACCACC GTCGGCGCGG CCGACGAGGC CACGCCTCAC
CTGGAACGCG CCTCGACGTT GATCACCGGC AGCGGCCAGA CGGGCATGGT GGTCTCGGTG
CGGATGGCGC AGGCCCGGGC CGCGCTGGCC CGGGAGGTCC CGGATCGCGA CGAGGCAGCC
CAGCACGCCG CTCAGGCGGT GGCGCTGTCC GCCGGGCGGC CGGCCGAATG GGTGGCCCGG
CTGGTCCGGG ACGTGTCGGA TCTGGCCGAG CGCCGCACCG GCCACGGTCT CGACGATCTC
ATGGACGCCA CCTCGACCTG GTTGCATCAG GGGTCGGAGG AAAGCCAACG GGTGACGCGC
ACGCAACCGG CGCGGTTCGG CGTTCCGGCG CCGGCCGGCG TCATGGTGCA GCGGGCTGGC
GGACCACCGA CTGTGCCTGC GCTCCGGCAC GTACCGTGA
 
Protein sequence
MSEVPEVADR RSVLVATGLA AVGSVSLRGD ASGLHEACMA ALTATVRAYE TSDLITTTSA 
LVSLERTTQT VPREHRLRGR PALEWMRLHA AVTTVAAATA YDRGKHSDAA ARADRAAALA
RAAGDGPLAA RALALRARVV RPHSPAVSLQ IAGAAARIAG HSSTRALIAG KVVTSACAAT
GDRAGVRDAV ARAWQTMERL DDTAHGSPGF SLDTYSPADL ALACAEALTT VGAADEATPH
LERASTLITG SGQTGMVVSV RMAQARAALA REVPDRDEAA QHAAQAVALS AGRPAEWVAR
LVRDVSDLAE RRTGHGLDDL MDATSTWLHQ GSEESQRVTR TQPARFGVPA PAGVMVQRAG
GPPTVPALRH VP