Gene Francci3_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3059 
Symbol 
ID3904260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3627569 
End bp3629170 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content74% 
IMG OID637880380 
Productphosphoesterase, PA-phosphatase related 
Protein accessionYP_482145 
Protein GI86741745 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG0671] Membrane-associated phospholipid phosphatase
[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGTCAA GGTCGGTGAC GAGGCCGGGA AACGGGACCG CTCCCCGCTG GCCCGCGGAG 
AACCGGGTCG GTCGGCGCGT CCGGATCCTC ATGCGCCGTG ACATCGCGCT GGGCCGGACG
TTCCTGCACG CCTTCACCAG TGCGGACCGG TCGCTGTTCG CGGCGCTGGC CGGCGGCAGA
CCGCTGCTCG ACCCGGCGCT GCCGCGGCTG TCCCACGCTG CCGATCATGG TCTGCTCTGG
TGGGGGGTGG CCGGGGCGCT CGGTGCGACG AAGGGACGCC GTCGCCCGGC GGCCGTCCGT
GGCCTGCTCG CCCTGGGAGT CGCGAGCGTT CTCGCCAACG GGCCGATGAA GGTGGTGTTC
CGCCGGGACC GCCCACCGAC TCACACGATT CCCCCGCTGC GGCGGCTTCG CGAGGATCTC
ACGACGTTCT CGTTCCCCTC TGGGCACGCG GCGTCGGCAG CCGCCTTCGC CACCGGAGTC
GCGTTGGACG CACCCGGCGC GGCGGTGCCG GTGGCTGTGC TCGCCGCCGC GGTGGCGTTC
TCCCGGGTGT ACGTCGGGGT GCACTATCCC GGGGACGTGG CCGCCGGCGT GCTGCTGGGG
ATAGGGGCGG GCCTGGCGAC GACGAAGGTG ATGCCGCGTC GGCCGTGGGC CCCCGCCCGG
GCCAGCCCCG CGTCGGCGTG GGCGCCCGCC CTGCCCGACG GCGACGGCCT TACCGTGGTG
GTGAACGCCC GTTCCGGTCC GGGTAACCAC ACGGACCTGC TCGCCGTGCT GCGGGCCGAC
CTGCCCAGGG CCCGGGTCGT CGAGGTCGAC GCGGGCGGCG ACGTAAGAAC CGTGCTGAGG
TCCGCGGCGG CGCGGTCCCG TGTGCTGGGC GTCGCCGGCG GCGACGGCAC CATCAACGCG
GCCGCCCAGA CCGCGCTGGC GCACGGCGTG CCGCTGGCGG TCTTCCCCGC GGGCACGCTC
AACCACTTCG CCGCGGACGT CGGACTCGCC GGGGCGGGTG ACTCGGTGCA GGCGATACGG
GAGGGTTCGG CGGTCGCCGT CGACATCGGC CGGGCCGAGG GCATCGGCGC GACGTTCAGT
CGGTTCTCCC GCATCTTCGT GAACACCGCG AGCCTCGGCG GTTACCCGGA CATGGTCGCC
ATCCGTGCGC GGTTCGAGCG CCGCATCGGC AAGTGGCCGG CGATGCTCAT CGCGCTGAGC
TGGGTGCTGC GCCACGAAAC GCCGTTCGAG GTCGAGATCG ACTCCGAGTA CCGCCGGGTC
TGGCTGATCT TCGTCGGCAA CGGGATCTAC CAGCCCGACG GGTTCGCCCC CACCTACCGC
ACCCGCCTCG ACGAAGGGCT GCTGGACCTG CGGGTGGTCG ACGCCGCGGC GTCGCTTGCC
CGGCTCCGGC TGGTGGGGGC GGTGCTGACC GGCCGTCTCG GGCGCAGCCG GGTGTACGAA
CAACACACGG TCGAACGGGT CACGATCTCA TCCCGTCAGC CCGGCCCGCT GCCGTTCGCC
TGTGACGGCG AGGTCACGGA GGGAGTGGAA CGCATCGTCA TCACTCCGGG CGGAGCCCGA
CTGATCGTCT ACCGGCCGCG GCGGCCCGGC GCATCCGGCT GA
 
Protein sequence
MASRSVTRPG NGTAPRWPAE NRVGRRVRIL MRRDIALGRT FLHAFTSADR SLFAALAGGR 
PLLDPALPRL SHAADHGLLW WGVAGALGAT KGRRRPAAVR GLLALGVASV LANGPMKVVF
RRDRPPTHTI PPLRRLREDL TTFSFPSGHA ASAAAFATGV ALDAPGAAVP VAVLAAAVAF
SRVYVGVHYP GDVAAGVLLG IGAGLATTKV MPRRPWAPAR ASPASAWAPA LPDGDGLTVV
VNARSGPGNH TDLLAVLRAD LPRARVVEVD AGGDVRTVLR SAAARSRVLG VAGGDGTINA
AAQTALAHGV PLAVFPAGTL NHFAADVGLA GAGDSVQAIR EGSAVAVDIG RAEGIGATFS
RFSRIFVNTA SLGGYPDMVA IRARFERRIG KWPAMLIALS WVLRHETPFE VEIDSEYRRV
WLIFVGNGIY QPDGFAPTYR TRLDEGLLDL RVVDAAASLA RLRLVGAVLT GRLGRSRVYE
QHTVERVTIS SRQPGPLPFA CDGEVTEGVE RIVITPGGAR LIVYRPRRPG ASG