Gene Francci3_2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2157 
Symbol 
ID3906757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2526089 
End bp2527858 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content73% 
IMG OID637879492 
Productmetal-dependent hydrolase 
Protein accessionYP_481258 
Protein GI86740858 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3454] Metal-dependent hydrolase involved in phosphonate metabolism
[COG3639] ABC-type phosphate/phosphonate transport system, permease component 
TIGRFAM ID[TIGR02318] phosphonate metabolism protein PhnM 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.321413 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGC TGCTGGGGCT GCTCGGCCTG GCCGGGTGGG CCTTCGCCGC GCTGTCGATC 
AACGTCGCGA CGCTGATCGA CAGCGGGCGG AACGCGGCCG ACTTCGCCGC CCGGATGCTC
CCCCTGGACT TCCCCACCGC GGGGGAGCTG CTGCGCCTGA CCGGCCAGAC CATGTCGATC
GTCATCTGCG CCACGCTGCT GTCGGTGGTG CTCAGCACGG GGCTTGCGGT CTTGGCTGCC
GGGAACACGG CGCCGCACCG CGGTGCACGG TTCGGCGCGC GGACAGTGAT CGTCGCGGCC
CGCGCGGTGC CGGACGTGGT GCTGGCGACC GTGTTCTTTC GGCTGTTCGG GTTCGGCGCG
CTGGCCGGGG TTCTCGCGAT GGGACTGCAT TCGGTCGGCA TGGTCGGCAA GCTCTACGCC
GACGCGGTCG AGCAGATCGA CGAGGGCCCG CGTGAGGCCA TGCGCGCGGG CGGCGCGGGC
CGTGGGAGCT GGTGGCCGGC GTGCTGCCGC AGGTGCTGCC CGCGTTCGTG GCGACCGCGC
TGCACCGGCT GGACATCAAC CTGCGGGTCT CGGTCGTCCT CGGCTTCGTG GGTATCGACG
GGCTCGGCCG CGCCATCGCG ACCGGCTGCT GATCGTGTGT GTGCGGGGCA TCCCCGATCT
GGTGCTGGCG ATCGTGTTCG TGGTGATCAC CGGGCTCGGT GCGGTCGCCG GGGTGCTCGC
GCTGGGCGTC GGTGCTGTCG GGCTGCTCGG CAAGCTCGTC GCGGACTCCG TGGAAGAGGT
CGACCCGGGT GTCGAGGACG CGCTGCGCGC CACGGGATCG AGAACGAGCA GCTGCCCCGA
CCGGGCGCCG AGCTGCCGAT CGAGTTCGCG CTGCTGTCCT TCGAGGGCAA GCTGCGGGCC
GCGGGCGTGA CGACCGTGTT CCACGGCATC TCCTTCGAGG ACACCCACCA CGACATCCCG
CGCAGCGTGG GCCAGGCGGA GAAGACGTGC GAGGCGATCG ATGCTTATAC CGGCGGGCTC
GTCGACCACC GGATCCTGTA CCGGCTCGAC GTGCGCAGCC CGGAAGCGCT GTCCGCGCTG
GCGCGGCGGT TGGACCAGGT TCCCGACGGC GCCCTGGTCT CCCACGAGGA CCACACCCCG
GGCCAGGGCC AGTACGCCGT CCGCGAGCAC TACGAGCGCT ATTTGATGGG CAGCCGCGGC
ATGTCCGACG CGGAGGCTCG CGAGCATGTC GACCAGCTCA TCGCCGACCG GGACGGCAGG
CTCGACATCC GCGAGGAGGC CCTGGTCTGG CTCGCCGCGC GCTCGGCGCG AATCCGGCTG
CTCGGGCACG ATCCCAGCTC GGCCGCCGAG ATCGCCGAGC TGCGGGACCG CGGCTGCGCG
GTCGCCGAGT TCCCCACCAC GATCGAAGCG GCGGAAGCCG CCCGCGCCCA CGGACTTCCC
GTCGTCATAG GCGCCCCGAA CATCCTGCGG GGACGTTCCC ACAACGGCAA CGCCTCCGGC
CGTGACCTGG TCGGCCGCGG CCTGGTGACC GCGCTGGTGT CGGACTACCT GCCCTCCGGT
CTGCTCGCGG CCGCCATGCT GCTCGCCGAG CAGGGGCTCG CCACCCTGCC GGCCGCCATC
GGCCTGGTGA CCGCCGGGCC GGCCGAGGTC GCCGGACTTC CCGACCGCGG TCGGCTCGAG
CCGGGGCTGC GCGCTGACTT TGTGCTGGTC GAGCCGCGCC GCCCGTGGCC GGTGGTGCGG
TCCGTGTTGT CATCCTGGGG TGTCTGGTGA
 
Protein sequence
MAALLGLLGL AGWAFAALSI NVATLIDSGR NAADFAARML PLDFPTAGEL LRLTGQTMSI 
VICATLLSVV LSTGLAVLAA GNTAPHRGAR FGARTVIVAA RAVPDVVLAT VFFRLFGFGA
LAGVLAMGLH SVGMVGKLYA DAVEQIDEGP REAMRAGGAG RGSWWPACCR RCCPRSWRPR
CTGWTSTCGS RSSSASWVST GSAAPSRPAA DRVCAGHPRS GAGDRVRGDH RARCGRRGAR
AGRRCCRAAR QARRGLRGRG RPGCRGRAAR HGIENEQLPR PGAELPIEFA LLSFEGKLRA
AGVTTVFHGI SFEDTHHDIP RSVGQAEKTC EAIDAYTGGL VDHRILYRLD VRSPEALSAL
ARRLDQVPDG ALVSHEDHTP GQGQYAVREH YERYLMGSRG MSDAEAREHV DQLIADRDGR
LDIREEALVW LAARSARIRL LGHDPSSAAE IAELRDRGCA VAEFPTTIEA AEAARAHGLP
VVIGAPNILR GRSHNGNASG RDLVGRGLVT ALVSDYLPSG LLAAAMLLAE QGLATLPAAI
GLVTAGPAEV AGLPDRGRLE PGLRADFVLV EPRRPWPVVR SVLSSWGVW