Gene Francci3_4235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4235 
Symbol 
ID3907201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5053309 
End bp5054922 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content73% 
IMG OID637881561 
Productpeptidase S15 
Protein accessionYP_483310 
Protein GI86742910 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.59607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGCCA TCGACGCGAA CCAGGCCGTA CCGGCGGCGG ACGGCGTCCT GCTGGCGACC 
GACGTCTACC GTCCCGACCG GCTACCCGCC CCGGCGGTGG TCACTCGCAC CCCGTACGGT
CGCGGTTCGC TGCTGGCAAA CGGCGTCGGG TGGGCGCGGA ACGGACTGGC CTACGTTGCC
CAGGACGTGC GGGGGCGCTA TGGGTCGGGC GGAACCTGGA CCCCGTATCA AGGGGAGCGC
GCCGATGGCC GGGCGTTGGT CGAATGGGTC CACCGCCAGC CCTGGTGCGA TGGGAACGTG
ATCCTCGCCG GAGCCTCCTA CGGCTCGTTC ACCGCGTGGG CAGCCGCCGT CACCGTTCCC
GAGCTCGTGC GCGCAGTGAT CAGCGAGGTA CCCGCCGCGG GTCTGCGGCC CGCCAACGTG
GACCCGTCGG GGATCCTGCG GCTGGCCGAG TACGCCGGCT GGTGGGCCGA GCACGCCGAG
AGCCGCACCA GCAGGAACGG GCTGTCCGCG CAGATGCTGG GCTGTGAGCC GGACCTGCTG
CGGCACCTGC CGGTAGCTGA CCTCGGCCGG CACCTCTGGG CACGGGTGCC ACGCTGGTGG
AGCGCCATAG CCCCGGCTCT GTCAGCCCCG GCTCTGTCAG CCCCGGCTCC GTTGACGACC
GGCGACAGCC CGGCACCGCA TGACCGTACT GGCGACGACC TCGGGGAGGG CATCAGCACG
CAGGAACTGG CCCGCTGCTC GCTGCCGTCC CTGCACATCG GCGGCTGGTA CGATCTCTTC
CTGCCACAGA CGCTGTGGCA GTGGGAAACC GCGGGCCGCG ACCGCGCTCC GAACAGGCCC
GCCCGGGGCC TGGTGATCGG GCCGTGGGGG CACGAGCTGT CGACCCCCGC TTCCAGCTCG
GCGGGTGGGC GGGAGCACGG GCCCGCCTCG CAACTGCCGC TGGGACGCCT CCAGGTCGCG
TGGATCTTTG ACGTGCTGGC CGGCCGGGAT GCGTCGATCA CCAAGGTGTT CCTCGTCGAG
GGCGGACGCT GGCTGGATCG GTGGCCGGCG TCCACCGCCA CCCTGGGCCT GCAGGCCAGC
GCTGACGGGT CACTCCTGCC GAACCCGCCC GAGCGGCCCG CCGAGCACCG GTTCACCTAC
GACCCGCTCG ACCCCTTCCC CAGCCTCCCG CGGGACTGCG ACCGTGCCCC CCTGGACGCC
CGCGCCGACG CCGTGGCATT CCGGACCCCG CCACTGACGA CGCCGACTGC CATCGTCGGC
GCACCCACCG TCACGATGGC CGCAGACACC ACGGGCCCCG GCACCGACTG GATAGTCCGG
CTGGTGGAGC GGCTCGGTGA CGGCCGGGCC TTGGAGGTCA CCAGCGGCGC CGCTGCCGTC
GGGCCCGGCG CGGCCACGGT GTCGATCCCG CTCGGCGCCA CGGCCCTGCT GCTCCACCCC
GGCAGCCGGC TGGAGCTGCA GGTCACCAGC AGCGACTTCC CGCGGCTGGC CCGCACCCCC
AACACCGGCC AGGACCGGTA CACCACCAGC GCCACCCGGA TCGCCACCCA GACCATCCAC
ACCGGTCCAA CCCGCGGCTG CCGGGTCGAC CTGCCCGTGC TGGAGCACCC GTGA
 
Protein sequence
MHAIDANQAV PAADGVLLAT DVYRPDRLPA PAVVTRTPYG RGSLLANGVG WARNGLAYVA 
QDVRGRYGSG GTWTPYQGER ADGRALVEWV HRQPWCDGNV ILAGASYGSF TAWAAAVTVP
ELVRAVISEV PAAGLRPANV DPSGILRLAE YAGWWAEHAE SRTSRNGLSA QMLGCEPDLL
RHLPVADLGR HLWARVPRWW SAIAPALSAP ALSAPAPLTT GDSPAPHDRT GDDLGEGIST
QELARCSLPS LHIGGWYDLF LPQTLWQWET AGRDRAPNRP ARGLVIGPWG HELSTPASSS
AGGREHGPAS QLPLGRLQVA WIFDVLAGRD ASITKVFLVE GGRWLDRWPA STATLGLQAS
ADGSLLPNPP ERPAEHRFTY DPLDPFPSLP RDCDRAPLDA RADAVAFRTP PLTTPTAIVG
APTVTMAADT TGPGTDWIVR LVERLGDGRA LEVTSGAAAV GPGAATVSIP LGATALLLHP
GSRLELQVTS SDFPRLARTP NTGQDRYTTS ATRIATQTIH TGPTRGCRVD LPVLEHP