Gene Francci3_3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3559 
Symbol 
ID3904498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4256678 
End bp4257838 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content71% 
IMG OID637880880 
Productphosphoesterase, RecJ-like 
Protein accessionYP_482640 
Protein GI86742240 
COG category[R] General function prediction only 
COG ID[COG0618] Exopolyphosphatase-related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0624488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.616422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGGCA CGAGACAGCT GCGATGGGAA CTGACGATGA GCGATGCGAC CGATCTCGCG 
GGCCCGCCGG CGGCGTCGGT CAACCGCTCC CGCCCGGACG GTGCCGGTCC GACCGGTGAT
TTCCAGGCCG CGGTGACCGC ACTGACCGGC GCGATCGACG CCGGTGACGA CATCCTGCTG
ATCGCCCACG TGAACCCCGA CGGGGACAGT CTCGGATCGG CGTTGGCGCT GGGGCTCGCG
CTGCGGTCGC TGGGCGGAAG GCCGCGGGTG TCCTTCGACG CGGATCCCTT CGTCGTTCCC
CGGGTGCTTC GTTTCCTCGC CGGTCAGGAT CTGTTGGTGG CCCCCGAGGC CGTCACCGGC
CGGCCGGGTC TCGTGGTGAC CCTGGACGCC GGCAGCCGGT CCCGGCTGGG CAGGCTCGCC
AGCACCACGG CCGGTTGCCG GGTCCTGGTC GTCGACCACC ATGCCTCCAA CACCCGGTTC
GGCGATCTCA ATCTGGTCGA CCCCGAGGCG GCCTCGACCA GCGCCATGGT CATCGACCTC
GTCGATGCCC TGGGCGTGCG GCTCGACCGG GACATCGCGA CAGCGATCTA CACCGGTCTG
GTAACCGACA CGGGATCGTT CCGTTTCGCC GCGACGACAC CGGCGGTGCA CCAGCTGGCC
GCCCGGCTGG TGGCCACGGG CATCCGTCCC GACCTGATCA GCCGTGCCCT GTGGGACACC
CACCGCTTCG GCTATCTCAA GCTACTCGGG GAGGTTCTCG GCCGCGTCCG GCTGGAGCCC
GAGTACGACC TCGTCTGGTC CTGGTGCAGC CAGGCCGATC TGCGCGCCGC GGGACTGGAG
TACGACGAGA TCGAGGGGCT CATCGACACG GTGCGCACCG TGTCGGAGGC AGAGGTCGCC
CTTGTCTGCA AGCAGGATGG TGACGTCTGG AAGGTCTCGG TCCGTTCCAA GGGGGACGTT
GACGTCGGCG CGGTCTGTAC GGCTCTCGGT GGTGGCGGCC ATCGCTTCGC CGCCGGATTC
TCCTGCGGGA CGACGCTCGA CGAGTTGATG GACGTGCTTC GGGACGCGCT GGCGCGGGCT
CCGCGCCTGG GGGGCCCGGG GACTCCGGAG AATCCGCGCG GCCCGGGTGG CCCGAGGAAC
GCGGTCCGGA GCCCGCGGTG A
 
Protein sequence
MTGTRQLRWE LTMSDATDLA GPPAASVNRS RPDGAGPTGD FQAAVTALTG AIDAGDDILL 
IAHVNPDGDS LGSALALGLA LRSLGGRPRV SFDADPFVVP RVLRFLAGQD LLVAPEAVTG
RPGLVVTLDA GSRSRLGRLA STTAGCRVLV VDHHASNTRF GDLNLVDPEA ASTSAMVIDL
VDALGVRLDR DIATAIYTGL VTDTGSFRFA ATTPAVHQLA ARLVATGIRP DLISRALWDT
HRFGYLKLLG EVLGRVRLEP EYDLVWSWCS QADLRAAGLE YDEIEGLIDT VRTVSEAEVA
LVCKQDGDVW KVSVRSKGDV DVGAVCTALG GGGHRFAAGF SCGTTLDELM DVLRDALARA
PRLGGPGTPE NPRGPGGPRN AVRSPR