Gene Francci3_3356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3356 
Symbol 
ID3905938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3980137 
End bp3981288 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content72% 
IMG OID637880679 
Productepoxide hydrolase-like 
Protein accessionYP_482440 
Protein GI86742040 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.21914 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.116129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGATCA CACCGTCCCG GATCCGGGTG CCCGAGGACG TCCTGACCGG ACTGCGGCAG 
CGGATCGCGC GGGTGCGCTG GCCGCAACCG GCCCCCGGCC CGGCCTGGTC CCAGGGCACC
GACCTGGCGT TCCTCCAGGG AATGCTCGCC GACTGGGCCA CCTTCGACTG GCGTGCGGCC
GAGGAGAGAA TCAACGGCGG GTACGACCAG TTCGTCGCCG AGGTGTCCGG GCTGCGGGTG
CACTACGTCC ATCATCGGGT GCCGGGGGCC GACGGTCCGC CGGTCATCCT GACCCACGGC
TGGCCGAGCA GCTTCGTGGA GATGCTCCCG CTCGTCGACC GCCTGCGCGA CCCCGCGGCG
TACGGCATCG ACGCTCCCGC GCGGGATGTC GTCGCCGTGT CCCTGCCCGG GTATCCCTTC
TCCGAGCGTC CGGCCGGGGA ACACACGCTG CGGGACACCG CGCGTGTCTG GCACGACCTG
ATGACCGGGC TCGGCTATCC CCGGTACCTG GCCGCGGGCA GCGACTTCGG TTCCGGGGTC
AGCACCTTCC TGGCACTCGA CCATCCCGAC ACGGTGGCCG GGCTGTACCT GACCGATCTC
GAACTCGACC CCGTCCTGGA CCCGGCCGTC GACCCGACAC CGCTGTCCCC GGCCGAGCGT
GCGTATCTGG ACGCCGGCGA ACGGTGGTCG CTGACCGAAG GCGGCTATCA CGCGATCGCG
TCCACCCGGC CCCAGACCCT CGCGTACGGG CTGACCGACT CGCCCGCCGG GCTGGCCGCA
TGGCTGCTGG AGAAGTGGCG TGCCTGGTCG GACTGCGCCG AAGGCCGGGT GCCCCGAGTG
TCCCGGGAGT TCCTGCTGAC CACGCTCACG CTCTACTGGG CCACCGGTTG CGTCGGGAGC
ACGCTGCGCG ACTTCCACGA CAACCGCCAG GTCCAGGAGG GCATGACGGT CGGTGACCGG
GTCCTCGCCC CCACCGCGTT CGGGCGTTTC GGGAACGGCC TGGATGACCT TCGCCCGCCC
CCGCCCGAGT TCGTCGGACG GCTGTGCCGC GTGGTGCGCT CCACGGTGCA CGACGAGGGC
GGGCACTTCC CCGCGGTGGA GGTTCCCGAC CGGCTCGCCG CCGACATGCT CGCCTTCTTC
GCCGAATGCT GA
 
Protein sequence
MLITPSRIRV PEDVLTGLRQ RIARVRWPQP APGPAWSQGT DLAFLQGMLA DWATFDWRAA 
EERINGGYDQ FVAEVSGLRV HYVHHRVPGA DGPPVILTHG WPSSFVEMLP LVDRLRDPAA
YGIDAPARDV VAVSLPGYPF SERPAGEHTL RDTARVWHDL MTGLGYPRYL AAGSDFGSGV
STFLALDHPD TVAGLYLTDL ELDPVLDPAV DPTPLSPAER AYLDAGERWS LTEGGYHAIA
STRPQTLAYG LTDSPAGLAA WLLEKWRAWS DCAEGRVPRV SREFLLTTLT LYWATGCVGS
TLRDFHDNRQ VQEGMTVGDR VLAPTAFGRF GNGLDDLRPP PPEFVGRLCR VVRSTVHDEG
GHFPAVEVPD RLAADMLAFF AEC