Gene Franean1_0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0301 
Symbol 
ID5668725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp356152 
End bp357639 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content79% 
IMG OID641239231 
ProductD-alanyl-D-alanine carboxypeptidase/D-alanyl-D-alanine-endopeptidase 
Protein accessionYP_001504673 
Protein GI158312165 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2027] D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) 
TIGRFAM ID[TIGR00666] D-alanyl-D-alanine carboxypeptidase, serine-type, PBP4 family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.306816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.359566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCTGG GCGCGCGGGC CGGTGGCCTC ACGGCACTGG CCGGAGCGCT GCTCGCCGGA 
TCGCTCTCGC CCGGTCCGGC GGCGCCTCCG GGCGGCGCGT CACCGGCCCC GCCGGGTGTA
CGCGCCGAGG CGTCCGCGCT GCCGGGGCTG GACCCGGACG CGCCGAAGGC CGACCCGGCC
GCGGTGGCCG CGCGGCTGCT CGGGCCGCTC TCCGACCCCG TGCTGGGCGG CCCTGCCACG
CTGGTGGTCG ACGCGCTGAC CGGGCAGGTG CTTTTTGCGA ACCGCCCCAC CGAGCCGACC
GCTCCCGCAT CCACGGTCAA GATCGCCACG GCGACCGCGG CACTCACCGC GCTCGATCCG
GACGCCCGCC TGCGCACCCG CGCGGTCTAC CTCCCGCCGG CCAGCGCGCC ACCCGCGGCG
ACCGCAGGGT CACCTGCGGC AACCACCGGG TCGGCGGCTG CGCCCGGCCG GGCGCCCCTG
GCCCCAGGCG GCACGCTCTG GCTGGTCGGC GCGGGCGATC CGACCCTGAC CGCGGCGACC
GGGGCGGCCG GGTACCCGCC GGCGGCCCGG TTGAGCGACC TCGCCGAGCA GGTCCGCAAG
GCCGGCATCA CCGCTGTCGG CTCGGTGGTC GGCGACGGGA CGGCCTACGA GGGCCCGGGG
ATGGCGCCCG GCTGGCGCGA CGGCTACGTG ACCGACGGGA ACGTGACCCC GGTGTCCGCG
CTGTCGGTCG ACGCTGGCCG CGCGGCGCCC GGCTCCGCCG GGCCGCGCAG CCTGACCCCG
GACGCCGCCG CGGCGGCCGC GTTCGGCACG GCCCTCACCG CGGCCGGCGT GTCCGTCGGC
TCGGTCTCGA CGGGCCGGGC GGACGCGGCG GCCCGCGAGG TCGGGTCCGT GCAGAGCCCG
CCGATCCCGG TCCTGGTCGA GCGGATGCTC ACCGACTCCG ACAACGACCT GGCGGAAAGC
CTGGGACGTC AGGTCGCGAT CGCCCGGGGG CTTCCGGCGA GCTTCGACGG GGCCACCCGG
GGCGTGCTCG GCGCCCTTCG CGACGCGGGC ATCCCGACCG ACGGCGCCTC GCTGCGCGAC
ACCAGCGGAC TTTCGATCGA CAACCGGATC GCCCCGGCGA CGCTCGTCGC GGCGCTGCGG
ACGGCCGCCC TGCCGGGTCA TCCGGCCCTG CGGACGGTCC TGTCCGGGCT GCCGGTCGCG
GGCTTCACCG GCACGCTCGG CGACCGGTAC GGCCCCGGCG ACACCTCGCC GGGGGCCGGC
GTCGTCCGGG CGAAGACCGG CAGCCTGCGC ATCGTGACCA GCCTCGCCGG CATGGTGACC
GACAGCGACG GACGGCTGTT GCTGTTCGGG CTGTTCGCCC CGGTCGAGGA GCAGGGCCTG
ACCAAGATGG CGCTGGACCG GGTCGCGGCG GCCCTCGCCT CCTGCGGATG TCCGCCGGCC
GGCGCCGGGC CGCCATCAGC CGGGCCACCG GTGCCGCCGG GCGGCTAG
 
Protein sequence
MPLGARAGGL TALAGALLAG SLSPGPAAPP GGASPAPPGV RAEASALPGL DPDAPKADPA 
AVAARLLGPL SDPVLGGPAT LVVDALTGQV LFANRPTEPT APASTVKIAT ATAALTALDP
DARLRTRAVY LPPASAPPAA TAGSPAATTG SAAAPGRAPL APGGTLWLVG AGDPTLTAAT
GAAGYPPAAR LSDLAEQVRK AGITAVGSVV GDGTAYEGPG MAPGWRDGYV TDGNVTPVSA
LSVDAGRAAP GSAGPRSLTP DAAAAAAFGT ALTAAGVSVG SVSTGRADAA AREVGSVQSP
PIPVLVERML TDSDNDLAES LGRQVAIARG LPASFDGATR GVLGALRDAG IPTDGASLRD
TSGLSIDNRI APATLVAALR TAALPGHPAL RTVLSGLPVA GFTGTLGDRY GPGDTSPGAG
VVRAKTGSLR IVTSLAGMVT DSDGRLLLFG LFAPVEEQGL TKMALDRVAA ALASCGCPPA
GAGPPSAGPP VPPGG