Gene Franean1_4096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4096 
Symbol 
ID5672454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4881394 
End bp4883061 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content70% 
IMG OID641242972 
Productpeptidase S15 
Protein accessionYP_001508389 
Protein GI158315881 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.127358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGAT GTGCGCCAAC GTTCATCCGC CGCTTCGCTC CGGTCGCCGC TCTGGCGGTC 
GCCGCCTCTT TGTCCGCATC GGCTTTGTCC GCAACGGCCG GTATCGCCGC CCCGTCGACA
TCCCCGCCCT CGCCCGTCCT CGCCCGGCCT TTATCGGCGG GCACACAGGC CTGGTGGAAC
TACGACCGTC CGGCCACCTA CCAGACCGTC TCCACCGAGA CCACAGTTCC GACTCGGGAC
GGCACCCCAC TCAGCTGCAC CCTGGTCCGG CCGGCGACCA ACGGTGTGCC GGCCGCGGGT
ACCTTCCCCG GCCTGGTCGT CGAGTTCACT CCGTATACGG CATTACGGGC GACCTATGTC
GCCAGTGAGG CCACCTACTT CGCGACCCAC GGCTACAACG CCCTGGTCTG CAACCTGCGG
GGTACCGGCG GATCAGGGGG CACCTGGCAG AACGCCATGT CGGCCCAGGA CGGCAAGGAC
GCCCGCGACC TGGTCGAATG GCTGGCCGTC CAGCCCTATT CGAACGGCCG GATCGGAATG
ACCGGCGAGA GCTACGGCGG TGACACGACC TACGCCGCCG CCATCAACCG GCCGCCGCAC
CTGGTCGCCA TCGCCCCGCT GCAGTCGCCG GCCGACCTCT ACAGCGACGT CATCTACCCC
GGTGGCATCA AGACCACCGA GGGCGGCAGC GTCGACAACT GGCCCGACAC CGCGCAGCTG
ATCAGCGGCG GAGCGATCAA CGCCGCGGCC GAGTACGCGG CCAACCGCGC CCACCCCACC
TACGACAGCT ACTGGCGGTC GCGGACGTTC GTCGGCCACC ACGACGAGAT CAATGTCCCG
ATCTTCGCGA TGGGCGGGTG GGTCGACCAG TACTTCCGCT CCGGCCAGCT CACCAACATC
GAAGGTGCTC TCAGCCGCAC GTGGGCCATC TACGGCCAGT GGCCGCACCG GGCCCCGCTG
CGGTACCCGA ACTGCCCGGG GCTCTGTAAC CCCGAAGGAC TTTCCCCGGG GATCACGCTC
GCCTGGTTCG ACCACTGGGT GCTGCGGCTT CCGAACGTCC CGATCCCACC CCAGCCGACC
CTGGTCAGCT ACGCCGGCGC CAGCCCCGCC ACGACCTCCT CAGGCTGGCG GGAGATCGTC
GGCTACCGGC CGACCGGCAC CCCCGTACGT GCGTTCACCC TCAACCCGGA CGGCTCGCTC
GGAGCGGCCA GCCTGACACC CGGAACCAGC ACGTTCCACC AGCCGCAGGA CCCGTCGACC
GCCGGGGGGT CCCTCCTCTT CCAGACCGAC GCGCTCACCG CACCGACGTC GCTGTTCGGC
CGCCCGACGC TCACACTGCA GGCGCGGCTG TCCGGCCCGG ACGCCAACCT CTACGTCGAA
CTGCTCGACG TCGGCCCGGA CGGTACCGAG ACGCTGGTCA ACAACGGCTT CCTCCGGGCC
AGCCACCGGA TCTCGAGCGT CGTGCCCACC CCGGTGCCGA CCGGGGCACC CGTCACCCTC
ACCGTTCCGA TCCGGCCGGA CGACTGGCAG TTCGGGGCCG GCCACCGCAT CGCCGTGCGC
GTCTCCGGCG GCGTCCCGAC GATGCTCACC CAGAATCCGA CCCCGGTCGA CGTCACCGTC
GCCACCGGCG TCGGCGGCTC CACCCTGACC CTTCCGACCG TCGCCTGA
 
Protein sequence
MARCAPTFIR RFAPVAALAV AASLSASALS ATAGIAAPST SPPSPVLARP LSAGTQAWWN 
YDRPATYQTV STETTVPTRD GTPLSCTLVR PATNGVPAAG TFPGLVVEFT PYTALRATYV
ASEATYFATH GYNALVCNLR GTGGSGGTWQ NAMSAQDGKD ARDLVEWLAV QPYSNGRIGM
TGESYGGDTT YAAAINRPPH LVAIAPLQSP ADLYSDVIYP GGIKTTEGGS VDNWPDTAQL
ISGGAINAAA EYAANRAHPT YDSYWRSRTF VGHHDEINVP IFAMGGWVDQ YFRSGQLTNI
EGALSRTWAI YGQWPHRAPL RYPNCPGLCN PEGLSPGITL AWFDHWVLRL PNVPIPPQPT
LVSYAGASPA TTSSGWREIV GYRPTGTPVR AFTLNPDGSL GAASLTPGTS TFHQPQDPST
AGGSLLFQTD ALTAPTSLFG RPTLTLQARL SGPDANLYVE LLDVGPDGTE TLVNNGFLRA
SHRISSVVPT PVPTGAPVTL TVPIRPDDWQ FGAGHRIAVR VSGGVPTMLT QNPTPVDVTV
ATGVGGSTLT LPTVA