Gene Franean1_1288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1288 
Symbol 
ID5669701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1553891 
End bp1555051 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content74% 
IMG OID641240220 
Producthypothetical protein 
Protein accessionYP_001505648 
Protein GI158313140 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.930248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCGCC GCACCGTGCT GCGCCTGGCG CTCGCGAGCG CCGGGGTGGC CGCTCTGTCC 
GGAGCGAGCT GGGCACCCGC GCTCGCCGCC ACCGCCCGAC CGGGCGAGGG CCCCTATGGC
CTCCCTCAGG CACCCGACGC CGACGGGGTC GCCCTGCCCC GCGGGTTCAC CAGCCGGGTG
GTCGCCCGTT CGCGCCAGGT CGTCCCCGGC ACGGATCTCG AGTGGCACGA CGCCCCGGAC
GGCGGCGCCT GCTTCCCCGC CGGAACGGGT TGGACCTATG TCTCCAACTC CGAGACGACG
TGGCGCGGGG GCGCCTCGGC GCTGCGCTTC GACGCCACCG GCACGATCGT CTCGGCCAGC
AGGATCCTGC GGCGTACCTC GGCCAACTGC TCGGGCGGGG CGACGCCGTG GGGCACCTGG
CTCTCCTGCG AGGAGCACGC GTTCGGGCAG GTGCACGAGA CATGGCCGGA CGGCCGCCGG
GACGCGGTGG CACGCCCCGC GATGGGGCGG TTCACCCACG AGGCGGCCGC CTGCGACCCC
GACCGCCAGG TCGTCTACCT CACCGAGGAC CGCCGCGACG GCTGCTTCTA CCGCTTCCGC
CCGGCCCGGT GGGGCGATCT GTCCGCGGGC GTCCTGGAGG TGCTGGTCGC ACCCGAGGAC
ACCGAGTCCG GTCCGGTGCG CTGGGCGCGG GTCCCCGACC CGGACGGCCT GCCGCGCTCG
ACCCGCAGGC AGGTCTCCGA CGCCCGCGCC TTCGACGGCG GCGAGGGCTG CTACTACGTG
GCGGGCACCT GCTTCTTCAC CACCAAGGGC GACAACCGGG TCTGGGCCTA CGACGCGGTC
GGCGAGCGGA TCTCCGTGCT CTACGACCCC GAGCAGGTCC CGCGCGGCGG AACGCGGATG
ACGGGCCCGG ACAACATCAC CGGATCGGCC GCCGGCGACC TGTTCATCGC CGAGGACAAC
CCCGGGCCGG CACTGCACAT GATCACAAGT GCCGGCGTCG TCTCCCGTTT CCTGCACCTG
CCGGACCATC GCCGCTCCGA GATCACCGGC CCGGCCTTCA GTCCCGACGG GCGCCGGCTG
TACTTCTCCT CCCAACGGGG GAAGGACGGG CGCGGCCGGA CCGGGATGAC CTTCGAGGTG
TCCGGTCCGT TCCGGCGGTG A
 
Protein sequence
MDRRTVLRLA LASAGVAALS GASWAPALAA TARPGEGPYG LPQAPDADGV ALPRGFTSRV 
VARSRQVVPG TDLEWHDAPD GGACFPAGTG WTYVSNSETT WRGGASALRF DATGTIVSAS
RILRRTSANC SGGATPWGTW LSCEEHAFGQ VHETWPDGRR DAVARPAMGR FTHEAAACDP
DRQVVYLTED RRDGCFYRFR PARWGDLSAG VLEVLVAPED TESGPVRWAR VPDPDGLPRS
TRRQVSDARA FDGGEGCYYV AGTCFFTTKG DNRVWAYDAV GERISVLYDP EQVPRGGTRM
TGPDNITGSA AGDLFIAEDN PGPALHMITS AGVVSRFLHL PDHRRSEITG PAFSPDGRRL
YFSSQRGKDG RGRTGMTFEV SGPFRR