Gene Franean1_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1858 
Symbol 
ID5670260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2231264 
End bp2232871 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content76% 
IMG OID641240779 
Productphosphoesterase PA-phosphatase related 
Protein accessionYP_001506202 
Protein GI158313694 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.832371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00202108 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGGCGGA TTCCGCGGCC CGTTCACCGT TCCCGCACCC GTGACCGTGA CCGTTCCCGC 
GACCTGGTCG GCGAGAGCGA TCTGCCCCGC GTCGGCGACC TGGCCCGCCT GCTCGTCCGG
CACGACGTCG CGATCGGCCG GCGTTTCGTG CACACCTTCA TCACGGTCGA CCGCGCGATG
TTCTCCGCCA TCGCCGGCGC CCGGCCGCTG GTGGACCCGC TGCTGCCCCG GCTCTCCCAC
GCGGCCGACC ACGGCATGCT CTGGTGGGGG GTGGCGGGCG CGCTGGGCGC GACGAAGGGC
CGCCGCCGCC CGGCGGCGAT GCGCGGCCTG CTCGCCCTGG GCGTCGCCAG CGCCGTCGCC
AACGGCCCGG CCAAGCTCCT GTTCCGCCGG GGCCGGCCGC CGACACACGG CATCCCGCCG
TTGCGCCGGC TGCGCCGTGA CCTGACGACC TTCTCGTTCC CGTCGGGGCA CTCGGCCTCG
GCGGCCGCCT TCGCCACCGG TGTCGCGCTC GACGCGCCCG CGGCGGCCGT CCCGGTGGTC
GCTCTGGCGT CCGCGGTCGC CTTCTCCCGG GTCTACGTCG GTGCCCACTA CCCGGGCGAC
GTGGTGGCCG GCGCCGCGCT CGGCATCGGC GCCGGGCTGC TGACGACGAA GGTGATGCCG
CGGCGCCCCT GGTCGCCGGC GCGGGCCCGG CCGGCGTCCG CCTGGGCCCC GGCCCTGCCG
GACGGGACGG GCCTGGGTGT GGTCGTCAAC GCCCGGTCCG GCGCCGGTCA CCACGCCGAG
CTCGCCGCCG TCCTGCGTGC CGACCTGCCC GGTGCGCAGG TCGTCGAGGT CGGCCCCGAC
CAGGACGTCG CCGAGGCCCT GGACCGCGTC GCCGCGCACA GCCGGGTCCT CGGCGTCGCC
GGCGGCGACG GGACGGTCAA CGCCGCGGCC GCGGCCGCTC TCGCCCGGGG CCTTCCGCTC
GCGGTGTTCC CGGCCGGCAC CCTCAACCAC TTCGCCGCCG ACGTCGGCCT GAACAGCGCC
GGTGACACGA TCAAGGCGGT CCGCGAAGGC ACGGCCGTCG CGGTGGACGT CGGCAGGGTC
GACGGCATCG GCGCGGCCGA CTCCCGGTTC AGCCGGATCT TCGTGAACAC CGCCAGCCTC
GGCGGCTACC CGGACATGGT CGGCATCCGG GAGCGTTTCG AGCGCCGGAT CGGCAAGTGG
CCGGCGATGA TCATCGCGCT GAGCCGGGTG CTGTGGAGCG ACCCGCCGTT CGACGTCGAG
ATCGACGGCG TCACCCGCCG GGTCTGGCTC GTCTTCGTCG GCAACGGCCG CTACCTGCCC
GACGGGTTCG CCCCGACATA CCGGACCCGC CTCGACGAGA GCCTGCTCGA CCTGCGGGTC
GTCGACGCGA CCGCGCCGCT GGCCCGGCTC CGCCTCGTCG GCGCGGTGCT GACCGGACGT
CTCGGCCGGT CACGGGTCTA CGAGCAGCGG ACGGTGGAAC GGGTGCGGAT CTCCTCCAGT
CAGCCGTCCC CGCTGCCGTT CGCCAGCGAC GGCGAGGTGA CCGAGGGGAT CCGCCGGATC
GCGGTCACGA CGAGCGGCGC CCGTCTCATC GTCTACCGGC CCGAGTAG
 
Protein sequence
MRRIPRPVHR SRTRDRDRSR DLVGESDLPR VGDLARLLVR HDVAIGRRFV HTFITVDRAM 
FSAIAGARPL VDPLLPRLSH AADHGMLWWG VAGALGATKG RRRPAAMRGL LALGVASAVA
NGPAKLLFRR GRPPTHGIPP LRRLRRDLTT FSFPSGHSAS AAAFATGVAL DAPAAAVPVV
ALASAVAFSR VYVGAHYPGD VVAGAALGIG AGLLTTKVMP RRPWSPARAR PASAWAPALP
DGTGLGVVVN ARSGAGHHAE LAAVLRADLP GAQVVEVGPD QDVAEALDRV AAHSRVLGVA
GGDGTVNAAA AAALARGLPL AVFPAGTLNH FAADVGLNSA GDTIKAVREG TAVAVDVGRV
DGIGAADSRF SRIFVNTASL GGYPDMVGIR ERFERRIGKW PAMIIALSRV LWSDPPFDVE
IDGVTRRVWL VFVGNGRYLP DGFAPTYRTR LDESLLDLRV VDATAPLARL RLVGAVLTGR
LGRSRVYEQR TVERVRISSS QPSPLPFASD GEVTEGIRRI AVTTSGARLI VYRPE