Gene Francci3_4384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4384 
Symbol 
ID3907358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5234882 
End bp5236642 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content69% 
IMG OID637881715 
Productmetallophosphoesterase 
Protein accessionYP_483459 
Protein GI86743059 
COG category[R] General function prediction only 
COG ID[COG1409] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.508778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.535785 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGAG TAGCCCCGCG GCTGCTCCGG CCATCGCAGT CTCGGCCGCG TAGCCGGGCT 
GGTCGCTCCA GAGCTGGAAC ACGTTCGTTC TCTCACACAT CGCTCGGTCG CACCGCTCGA
TCGGCGACCG GTAACAGAAT ATGGGAAAAT GTCAGCCACC GGGTAACCAC CTTGCCGGCC
GGGGCATCCG GCGAGATCGG GTCCGGTTGC CGGCGCAGGC ACGAGCACAG GCACAACTTC
ATCGCCCCGA GGACGATTCG GACTGCTGGT TCGGCGAACG GGAGAGCTCC GCTGGTGACT
GGTAGTCCCG ACGTCGCGCT CGGCGTTCAC CTGACCTTCG GCTCCGATCC CGCCACGTCC
ATGGTCGTCT CGTGGCTGAC CCGAACGGCC GTCCCTCGGC CCCAGGTACG GTTCGGTCCG
GCGGCTGGCG GATCTACCGG CTCGGTGACC GCGCTCACCC GTTCCTACAC GGACGCCCTG
ACAAACGAGG TGGTCTTCGC CCACCATGCG CACCTGTCCG GGCTCCTGCC GGCTGCCGAC
TACCGCTATG ACGTGGGTCA TGACGGGCGC TGGGGCTTGG CCCACGGGTC GTTCCGCACG
GCTCCGCGGC ATCGGGCCGC CTTCAGCTTC ACCTGCTTCG GCGACCAGGG CACCGACGAG
CCGCACGATC CGTACGGCTC GGCTGCGTCA CGCCACGTGA TAACCGGAGT GGAACGCCTC
GCGCCACTGT TCAACCTCGC GAACGGTGAC CTGTCGTACG CCAACCAGCG CACGGATCCG
GTTCGCGCCT GGTTCGACTG GTTCGCGATG ATCAGCGCCT CCGCCCGGTT CCGGCCGTGG
ATGCCCTGCA ACGGCAACCA TGAGACCGAG CGGGGCAACG GAGCTCTGGG GCTCGCCGCC
TACCAGACCT ACTTCGCCCT TCCCCAGCAC GACGAGGAGG CCTACCTCGC CGGGCTCTGG
TACGCGTTCA CCGTCGGCGG CGTGCGGTTC GTCATGCTCA GCGCCGCCGA CGTCTGCTAT
CAGGACAGCG GGCGGGTCTA CCTCCATGGG TACAGCGCCG GCCGGCAGAC CTCCTGGTTG
AGACAGACCC TCAAACAGGC CCGCGCCGAT CCCGGTATCG ACTGGATCGT CGTCGGCATG
CACCACGCCG CGGTGTCGAC CGCGGTGGAG CACAACGGCG CCGACCTCGG TATCCGGGAA
GAATGGCTGC CGTTGTTCGA CACCTACGAG GTGGATCTGG TGCTCTGCGG CCACGAGCAC
CACTACGAGC GCACTCATCC GCTGCGCGGG GTCGTGCCGG ACAGCGCGAC CCGGACTCCC
CGCCCGGTCC CCGGCGCGAC GACGCCCGCC CGGAAGACCG CTGACGGAGC GGGCGCCGCG
GCCGGTGACG GGGCCGGTGA CCTGCTCGAC ACCTCGGCGG GCACCGTCCA CCTGCTCGTG
GGCACCGGAG GATCCTCGTC GCCGTCCGCG CACGCACTGT TCGATCCACC CGCCTGCTGG
GTCATCGTCG GCGTGCACGA ACAGGATCCC GGCCGGTGGC ATCGCCAATC GGTCCGGGCG
AGAGAGGACG CGCCCTGGCT CGCCTTCCGG GCACCGGAGC ATCCATACGC CTTCGCGGCT
TTCGAGGTGG ATCCAGGTGA ACCGGGCGGC TCGACGAGCA TCCGGGTGAC CGTGTACGAC
TCCAGTGCAC CGACGCCGGT CCCGTTCGAC CGGTTCACCC TCGTCCGGCC ACGCGCCGAC
GCGGCCGTGC CCACCACCTG A
 
Protein sequence
MAGVAPRLLR PSQSRPRSRA GRSRAGTRSF SHTSLGRTAR SATGNRIWEN VSHRVTTLPA 
GASGEIGSGC RRRHEHRHNF IAPRTIRTAG SANGRAPLVT GSPDVALGVH LTFGSDPATS
MVVSWLTRTA VPRPQVRFGP AAGGSTGSVT ALTRSYTDAL TNEVVFAHHA HLSGLLPAAD
YRYDVGHDGR WGLAHGSFRT APRHRAAFSF TCFGDQGTDE PHDPYGSAAS RHVITGVERL
APLFNLANGD LSYANQRTDP VRAWFDWFAM ISASARFRPW MPCNGNHETE RGNGALGLAA
YQTYFALPQH DEEAYLAGLW YAFTVGGVRF VMLSAADVCY QDSGRVYLHG YSAGRQTSWL
RQTLKQARAD PGIDWIVVGM HHAAVSTAVE HNGADLGIRE EWLPLFDTYE VDLVLCGHEH
HYERTHPLRG VVPDSATRTP RPVPGATTPA RKTADGAGAA AGDGAGDLLD TSAGTVHLLV
GTGGSSSPSA HALFDPPACW VIVGVHEQDP GRWHRQSVRA REDAPWLAFR APEHPYAFAA
FEVDPGEPGG STSIRVTVYD SSAPTPVPFD RFTLVRPRAD AAVPTT