Gene Franean1_0188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0188 
Symbol 
ID5668613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp229619 
End bp231301 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content75% 
IMG OID641239117 
Productprotein serine/threonine phosphatase 
Protein accessionYP_001504561 
Protein GI158312053 
COG category[T] Signal transduction mechanisms 
COG ID[COG0631] Serine/threonine protein phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.108633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.365924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAGCT CACTGGTACC GATCATGCTC ATCGTGGTGC TTCTGGTGGG CGCGGTGGTT 
TTCGTCGTCC TGTCCAGTCG CGCACCACGC ACCGGCTACG GATCCGGGCG CAAGCGCTCG
CTGGAGACGA GACGCTCGCG GCCCGGCAGC ATCCCGCTGC GCGACGCCCC GGAGCCGCCG
GCTCCGGCCG CGCCGGACAC GCGCGCGCCG GACCCGCAGT CACCAGACGG CGTCGTGCCC
GGCTTCACCG CGCCCGCGAG CGCTCCCGCC GCGGTGGAGG TGGCGCAGGC CGCGGACGGC
ACGAGCCGCG GCACCGGCGG CGCGGCCGAT CCGGGCCCGG ACACCGGCTC CGCCTGGAGT
GACGGTTCCG CCTGGGGCGA CAGCGGGTGG GGCGACGCCG ACGCCACCCG CATCGACAGT
CTGCGCGATC CCGGCCACGG GCCTCCCGGC ACGGCGGGAA GTACAGCGAA TGACACCGAC
ACGGTCGCGT CCGCTACGGA AAGCGGCGGG ACCGGAGGTC ACAATATCGG TGGCGCGTAC
ACGTCACGCG CCGATACCGG CGCGGCCCCG GCGGATCCGT CCCCTGCGGG GGCGTACTTC
GCCGGAGCCG CACAGCCCGA CGGGCCCCAC GCCGGTGCCC ACGGCACGGG GGGTATCGAC
GTGAGTGGAC TCGACTCAGG CGGCAGGGAC ACCTACATGA ACGACAGCCG AGACCCGGAC
GCCACCGGGT GGACTCCGCC CCCCAGCGGG CGCCCGCACC ATGGCGAGAC GGACGACGCC
GCCTACCGGC TCGCGGTCGA CGACCGCGAC CCGCTCGCCG GGGGCTTCCC GCACCAGGGG
CCGTCGGGGT ACTCCGGACC GGCCTCCTAC CCCGACCCGA CGGCCGGCGA GACGATGCGG
ATCGGCTCGA CCGCGCCGAC CGTGCCCACC CCCGACCCCT CCGCGCCCGG TCTGGTCGCG
GCCGAGCCGG CCCAGACCAC GGCTCCCGCC CTGCGGCTGG CCGCGGCCGG CCGCACCCGG
CGCGGCAAGC GCGGCGGCCC CAACGAGGAC GCCTACGTCG TGTCCGACGG CCTGCTCGCC
GTCGCCGACG GCGTGGGCGG GGAGGCCGCC GGCCAGATCG CCTCGACGCT GACGGTCACG
ACGGTGGCCG GGTTCCGTCC CCAGTACGCC GAGGACCCGG CCGACGGGCT GCGCCAGGCG
GTCGCCCGGG CGAACCAGGT CGTGCGGGAG AAGCCCCGGC AGGAGCCGTC CTGGCGAGGC
ATGGCCTGCA CCCTCGACGT GGTCATCCTC GGCCGCCAGC AGTCGACGGG CGAGACACTG
TTCATCGCCC ACGTCGGCGA CAGCTCCGTC TGGCTGCAGC CGCAGAAGGG CCGACCGCGC
CAGGTCACGA CGCCGCACGC CATCAAGAAC GGCCCGCTGC TCAACGCCAT AGGCCTGGCC
GACCGGATCG AGGCCGACAT CCTGCGCGAG GCGGTGCGGT CCGGTGACCG GGTGATCCTC
GCCAGTGACG GCATCACGAA GGTGATGACC CCGGAGCAGC TGCTGGGGCT GATGACCGAG
CTCGGCCCGG AACCACCGGA GCGGGCCGCG GACGCCCTGG TCGAGGCGGC GCTGCTGGCC
GGCGCCCGCG ACGACACGAC CATTGTCGTC GCCGACCTGG TCTCGGAGCC GTCGGCGCGA
TGA
 
Protein sequence
MGSSLVPIML IVVLLVGAVV FVVLSSRAPR TGYGSGRKRS LETRRSRPGS IPLRDAPEPP 
APAAPDTRAP DPQSPDGVVP GFTAPASAPA AVEVAQAADG TSRGTGGAAD PGPDTGSAWS
DGSAWGDSGW GDADATRIDS LRDPGHGPPG TAGSTANDTD TVASATESGG TGGHNIGGAY
TSRADTGAAP ADPSPAGAYF AGAAQPDGPH AGAHGTGGID VSGLDSGGRD TYMNDSRDPD
ATGWTPPPSG RPHHGETDDA AYRLAVDDRD PLAGGFPHQG PSGYSGPASY PDPTAGETMR
IGSTAPTVPT PDPSAPGLVA AEPAQTTAPA LRLAAAGRTR RGKRGGPNED AYVVSDGLLA
VADGVGGEAA GQIASTLTVT TVAGFRPQYA EDPADGLRQA VARANQVVRE KPRQEPSWRG
MACTLDVVIL GRQQSTGETL FIAHVGDSSV WLQPQKGRPR QVTTPHAIKN GPLLNAIGLA
DRIEADILRE AVRSGDRVIL ASDGITKVMT PEQLLGLMTE LGPEPPERAA DALVEAALLA
GARDDTTIVV ADLVSEPSAR