Gene Francci3_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3820 
Symbol 
ID3905568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4578408 
End bp4580264 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content72% 
IMG OID637881146 
Productalkaline phosphatase 
Protein accessionYP_482899 
Protein GI86742499 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3540] Phosphodiesterase/alkaline phosphatase D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00587129 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.244243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGTG ACGCCGCGGC CGGCTCTCCG CGTTCCCGCC GATTCCCGCG GGCGCCCTCG 
CCGTCAGCCG CCACCGGCTC GCCGTCAGCC GCCACCGGTC AGACTGGTCG ACCGGCCGGC
ACCGATCCGT CGGCACGCGC CGATCCGGGC ACCGGTCCTC GCGCCGCCGC CCGGTCCGGC
CACGGGACGA ACCCGGGGGC CGATCCACGT CGCCGTGCCG TCCTGCTCGG CGGCCTCGGG
CTGGCGGGCG CCGCGCTGGG CGGGGCGAGC CTGGCGGCCT GCGGCGGCTC CGGCGGTTCG
ATCCCGGCCC CCGCGCCGAC CCTGCGGGCA CCCACCCCGA TCGCGGGCGT CACCGACGGG
GTCTTCGGGC TCGGGGTCGC CAGCGGCGAC CCGTTGCCGG ACGGCGTCAT CCTCTGGACC
CGGCTCGCGC CGAGGCCGAC CGAGGGCGGC GGCATGCCCA CGCGGGACAT TGAGGTCGAC
TGGCAGATCG CGACGGACGA GGGTTTCCGC GATGTGGTGC GCGCCGGCAC GCAGACCGCG
CAGACGGCGT TCGCGCATTC CGTCCACGTC GACGTCCGCG GACTCGCGCC GGAGCGTGAC
TACTTCTACC GGTTCCGTGC CGGCACCGTG CTCAGTCCAG TCGGCCGGAC CCGGACGGCC
GCGGCACCTG GGAGGGGACC GGAATCGACC GGCGGTGCCC TGACCTTCGC GCTGGCCTCG
TGCCAGGACT TCCAGAACGG GTACTGGCCG GCCCTCGACG GCATCGCCAC CGACGCGCCG
GACCTCGTCG TCCACGTCGG CGACTACATC TACGAGTACG ACCCGAAAAG CAACTACCCG
GACCGGCGGC ACACCACCCC GCAGCGGCCC GGTCTGGACC AACTCCAGAC GCTGGCGGAC
TACCGCAACC GGTACGGCCA GTACAAGTCC GATCCTGCCC TGCAGGCCGC TCATCACGTG
GCCCCCTGGG TCGTCACCTG GGACGATCAC GAGGTCGAGA ACAACTACGC CGGACTGATC
GACGAGGCCG GCGACGCCGG GGAGCAACGG CAGGACCCTG CGGTGTTCGC CCGCCAGCGT
GCCGCCGCCT ACCAGGCGTA CTACGAACAC ATGCCGATCC GCGCGGAGCT GAATCCGGGA
TCGCCCGACA TGCGGATCTA CCGGCGGTTC GTGTTCGGGA ACCTGGTGAC GTTCAACGTC
ATGGACACCC GGCAGTACCG GACCCGGCAG CCCGGCGACT CCCCGCAGGG CATCGGGCTC
GCATCCCTGG GCCGGGACAA CACGGCCGGC ACGATGGCCG GCGCCGCCCA GGAACGCTGG
TTGCGCGACG GCCTGACCAC GTCGCGGACC CGCTGGAACG TCCTCGCCCA GCAGACGATG
ATGGCCCAGC TGAACGGGCA GCTACCCCTC GGCGAGGGAC CACGGCTGGC CAACCTGGAT
CAGAACGACG GGTACGGCCC CTACCGGACC CGGCTGCTGT CGGAGATCCG CGACAGCGGC
GTGCGCAACC CGGTGGTGCT CTCCGGTGAC ATCCACTGCG CCTGGGTAAA CGACCTCCGG
GTCGATTTCG ACCGGCCCGA GACGCCGGTC GTCGCGACGG AGTTCGTCTG CACCTCGATC
AGCTCGGCCT TCTTCCTCGT CAGCGAGGAC TTCATCCGGC AGAACAATGC CCGACTCAAC
CCGCATGTCC GGTATTTTCG CGGTGACCGG CGAGGTTACA CGCGCGTTCG CGTCACCCCG
GACGAATGGC GCGCGGACAT GCGGGTCGTC GCCGACATCG CCCATCGCCA CTCGCCCACG
TCCACGGATG CCACCTGGGT GGTCGAGAAC GGGCGGCCCG GCGCGCGGCC GGCCTGA
 
Protein sequence
MTRDAAAGSP RSRRFPRAPS PSAATGSPSA ATGQTGRPAG TDPSARADPG TGPRAAARSG 
HGTNPGADPR RRAVLLGGLG LAGAALGGAS LAACGGSGGS IPAPAPTLRA PTPIAGVTDG
VFGLGVASGD PLPDGVILWT RLAPRPTEGG GMPTRDIEVD WQIATDEGFR DVVRAGTQTA
QTAFAHSVHV DVRGLAPERD YFYRFRAGTV LSPVGRTRTA AAPGRGPEST GGALTFALAS
CQDFQNGYWP ALDGIATDAP DLVVHVGDYI YEYDPKSNYP DRRHTTPQRP GLDQLQTLAD
YRNRYGQYKS DPALQAAHHV APWVVTWDDH EVENNYAGLI DEAGDAGEQR QDPAVFARQR
AAAYQAYYEH MPIRAELNPG SPDMRIYRRF VFGNLVTFNV MDTRQYRTRQ PGDSPQGIGL
ASLGRDNTAG TMAGAAQERW LRDGLTTSRT RWNVLAQQTM MAQLNGQLPL GEGPRLANLD
QNDGYGPYRT RLLSEIRDSG VRNPVVLSGD IHCAWVNDLR VDFDRPETPV VATEFVCTSI
SSAFFLVSED FIRQNNARLN PHVRYFRGDR RGYTRVRVTP DEWRADMRVV ADIAHRHSPT
STDATWVVEN GRPGARPA