Gene Francci3_0786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0786 
Symbol 
ID3905722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp915921 
End bp917609 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content65% 
IMG OID637878119 
Productpeptidase S15 
Protein accessionYP_479899 
Protein GI86739499 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.781394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACGG CTGCCAAGAC CACGAACGTG TTCGATTTCG CGCCCAATGA CAGAACACTG 
CTCGACGCGG AGATCGCCGC TCTCGAGAAT CTCGAGAGCG GCGACTTCAA GAATCTCCCG
CCGGCGACCG TGGCAAAGTT CGGGGAACTG CGGGGGGCGG GGACCTTCAC GGCGACCAGG
ATCCCCGGAG CCGATGCCGA CGTCCAACTC GACGCCGGCG TGCTGGTCCC CCTCAAGCCG
GGGCCGCACC CGGTTGTCAT CATGCCGGCT CCGCTGGCGC CGACGGGATG GAAGTCGTAT
CCGGGGATGC TCCTGAACTT CGCGCTCAAG GGATATCTGG CGGTCGCGTA CAGCGAGCGT
GGGATCGCCG ACTCCACAGG GAAGATAGAC GTCGCCGGTC CGCGGGACCG GGCGGACGGC
TCGGCCGTGA TCGACTGGGT ACTGAACACC TATCCCGACC GCGCGGATCA GGATCGGATC
GGGTTCGCGG GGTCGTCCTA CGGCGCCGGC CAGAGCCTGA TCATCGCGGC GCATGACGAC
CGGGTGCGGG CCGTGTGCGC GCAGAGCGCC TGGGCCGATC TGGGCCGATC GCTGTACGAG
AACGAGACAC GACACCTGCT GGCGTTCGAG GAGCTCGCCA AGCTGTTCGG CGAGGAGAAC
CTCTCCGCCG AAGTCAGGGA AATCTTCGAC GACTTCCGGG CGAATCGGAA TATCGAACGG
CTGCTGGAGT TCAGCGCGGT CCGTTCACCG GTTACCTATC TGAACGAGCT CAACGCCAGG
AGGGTCCCCA TCTTCCTCGG CACCTACTGG CACGAGACCA TCTTCTCGGT ACCCGCGGTG
GTGGAGTTCT TCAACGCATT GATCGGTCCC AAACGTCTCC TGGTACTGAT CGGAGATCAC
GGCGGCGACG AGATCAAGGG ATTTCTCGGC GGCATCAGTC GTCCGACAGC GACGACCTTC
CGCTGGCTGG ACCGCTTCGT TGCCGACGAG GAGAACGGCG TGGAGAACGA CGGCGACGTG
CACACCGAGT ACATGCACAA CCTGTTCACG ATCAAGAAGC TCGCCGACTG GGACTCGTAC
ACGCTGCCGC CGCGACGCTA CTACCTGAAC GCGCCGCTGC CGGGGACATC GGACGGCTCA
CTGACGCCTC AGCCCAACTC CGGGTGGAGC CACACCTTCC GGGCCGGCAC GGACACCCCA
GCACTGATAG CCCCGGCCCT GGTAAAGACC GGCGTCCTCG AAAGACTGGG GGCTCCGATC
GTGTACGAGA CCTCGGAGAT CTCCCGCGAG GACGCCGCGG TGTGGTCGAC CGAACCGCTC
ACCGCACCGA GTCAGATCAC CGGAACCGTC TCGGCGCATC TCACGGTCAC CCCGTCCGCG
AAGAGCGCGA CGCTCGTGGC GCACCTGTTC GACGTCGACC CGGGTACCGG TAAGGCAAAA
ATCATCACCA GCGCGCCCTT CACCCTGCTC AACGACCGCG AGGGAGAGAC GCAAACGATC
GACATCCACC TACAGCCGGC CGACTACCGC ATCACGCCGG GCCACCAGAT ACAGCTGGTC
GTGGACACCA AGGACAGGTT CTTCGGCGAC GCCACCGTCA CCAACAGCAA AATCGAGATC
TCGTCCACCG ACGGCTCCTC GTCCTACCTG AACATCCCGC TCGACGAGAT TCCGCTCGAG
AACGACTAG
 
Protein sequence
MTTAAKTTNV FDFAPNDRTL LDAEIAALEN LESGDFKNLP PATVAKFGEL RGAGTFTATR 
IPGADADVQL DAGVLVPLKP GPHPVVIMPA PLAPTGWKSY PGMLLNFALK GYLAVAYSER
GIADSTGKID VAGPRDRADG SAVIDWVLNT YPDRADQDRI GFAGSSYGAG QSLIIAAHDD
RVRAVCAQSA WADLGRSLYE NETRHLLAFE ELAKLFGEEN LSAEVREIFD DFRANRNIER
LLEFSAVRSP VTYLNELNAR RVPIFLGTYW HETIFSVPAV VEFFNALIGP KRLLVLIGDH
GGDEIKGFLG GISRPTATTF RWLDRFVADE ENGVENDGDV HTEYMHNLFT IKKLADWDSY
TLPPRRYYLN APLPGTSDGS LTPQPNSGWS HTFRAGTDTP ALIAPALVKT GVLERLGAPI
VYETSEISRE DAAVWSTEPL TAPSQITGTV SAHLTVTPSA KSATLVAHLF DVDPGTGKAK
IITSAPFTLL NDREGETQTI DIHLQPADYR ITPGHQIQLV VDTKDRFFGD ATVTNSKIEI
SSTDGSSSYL NIPLDEIPLE ND