Gene Franean1_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0937 
Symbol 
ID5669351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1095245 
End bp1097077 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content76% 
IMG OID641239864 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001505299 
Protein GI158312791 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0114509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.737923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCAGCA GCGAGCGCGT GTATCCGGTC CGCAGCGGTG TCGCTCCCGA CCGCGTTGCC 
ACCGTCGCCG AGGCGCCGGC CGGCGCTCCC GTGCAGGCCG TCCCCGCCGG CAGGCCTCCC
TCGGGCACCG TGACGACCAC CGCGGCCGGG GATGCCATCG AGGCCGGGGT GGCCGGGGTG
GCCGGGTCGG TCAGGGCGGT CGCGGAGCCC GGGCTGGCAC AGGCGGTGGC CGTGCTCACC
CGGCCCGACC TCGCCCACGT CGTCGACCTG GTCGCGTGGG TCGAGGACGG CTGGTTGCAC
GTCGCGAACG CCGACGGTGC GTCGCGCCTG CCGGTCGACG ATCCGGACGG GCCGTGGGAG
ATCCTGCGTG GCCGGGACCC GGTCGCCGAC CAGGACCCCA TGCACGGTGT TCCGCTGGTG
GCCGCGCTCG CCGACCCGTC CCCGCCGGCC GCGCGCAACG CCTACCCCTT CGCGGGACGG
CGCCTGCTGT CCATGTTCGC CGACCCGACG CGCTCGCCGG ACATCGCCGT CGTCCACACC
CCGCGGCACT ACTGGCCCGA GCGGGGCGGC CATCTGGGGG AGCACGGCTC GCTGGACGCC
GGGCAGTCCC GGGCGCCGCT CGTTCTGTCC GGGGCAGGTG TGACCGCCCG TGGCCTGCTG
CCGCGGGTCG CCCGGGTGAT CGACGTCGGT CCGACCCTGG CGGCGCTCGC CGGGGCGGCG
ATGCCCGAGG CGGAGGGAAC CGCGCTCGAC GATCTCGCCG GGCCGGGAGC CCGGCATGTG
GTCGGGCTGC TGTGGGACGG GACGAACTGC AACGACCTGC TCGACCTCGC CGCCCGGGGA
GAGCTCCCGA ACGTCGCCCG CCTGCTGGCC CGCGGCGTCG CGCTGACCGG CGGCGCCCTG
GCCGAGTTCC CGAGCGTGAC CCTCACCAAC CACACCTCGG CGATCACCGG CGTCGGCCCC
GGCCGGCACG GCATTCTGCA CAACGTCTAC TTCGACCGCG CGACCGACCG CCAGGTGATC
ACCAACGAGG CCGCGACCTG GCACGTCGCC TGCGACCAGC TCCGGGACGG CGTGTCGACG
GTGTTCGAGG CGGTGGCCCG CTCCCGCCCC GGGACCGAGA CGGCCTGCGT GAACGAGCCG
ATCGACCGCG GGGCGAGCTA CTCGACGTTC GGCCTGGTGC GAGCCCTCGG GGTCGGGCCG
GGCGCCGGCG GTGCCGGTGC CGAGGCCGCC GGTGGCGGGA TGGAGGACTA CCTGCCGGCC
GCCGAGGGCG ACCCGCACGC GAGCGCCGAA TGGGTCACCG CCGACCCGAA CTACGCCTGG
TCGACAAGGG TGGACGCGCT GGGGCTGACC CAGATCACCG ACCTGTGGGC CGACGGCCGG
GAACCACCTG TGCTGACCTG GTGGAACACC ACCATCACCG ACACCGGTCA TCACGGCGGC
GGGCCCTACT CACCCGAGGC CCGCGCCGCG CTGGCCGACG CCGACCGCCG GCTGGGGGTG
TTCCTCGACC TGGTGGAGCG CCGTGGCCTC ACCGACCAGA CGGCCATCCT GCTCACCGCC
GACCACGGAT TCGAGGCGGC CGACCCCGAC TGCCGCGGTG ACTGGGACGT CGCCCTGCAC
CGCGCCGGGG TGGTCTTCCG GGACGAGGGC TACGGCTTCA TCTACCTGGG CCTCGCCGGC
GACGACGAAG CCCCGGCCGA CTCCGCTGAT CCGGGCAGCC CGGCCAACCT CGGAGACCGG
GCCACGCCAG GTGGCCCGGC GGGCCAGGGT GGGCCGGGCG GCTCGGGGGC CCAGGCAACG
GCCTCCCTGC CGCCGCAGCC GCCAGGCACC TGA
 
Protein sequence
MPSSERVYPV RSGVAPDRVA TVAEAPAGAP VQAVPAGRPP SGTVTTTAAG DAIEAGVAGV 
AGSVRAVAEP GLAQAVAVLT RPDLAHVVDL VAWVEDGWLH VANADGASRL PVDDPDGPWE
ILRGRDPVAD QDPMHGVPLV AALADPSPPA ARNAYPFAGR RLLSMFADPT RSPDIAVVHT
PRHYWPERGG HLGEHGSLDA GQSRAPLVLS GAGVTARGLL PRVARVIDVG PTLAALAGAA
MPEAEGTALD DLAGPGARHV VGLLWDGTNC NDLLDLAARG ELPNVARLLA RGVALTGGAL
AEFPSVTLTN HTSAITGVGP GRHGILHNVY FDRATDRQVI TNEAATWHVA CDQLRDGVST
VFEAVARSRP GTETACVNEP IDRGASYSTF GLVRALGVGP GAGGAGAEAA GGGMEDYLPA
AEGDPHASAE WVTADPNYAW STRVDALGLT QITDLWADGR EPPVLTWWNT TITDTGHHGG
GPYSPEARAA LADADRRLGV FLDLVERRGL TDQTAILLTA DHGFEAADPD CRGDWDVALH
RAGVVFRDEG YGFIYLGLAG DDEAPADSAD PGSPANLGDR ATPGGPAGQG GPGGSGAQAT
ASLPPQPPGT