Gene Francci3_4311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4311 
Symbol 
ID3907280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5149597 
End bp5151048 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content75% 
IMG OID637881639 
ProductD-alanyl-D-alanine carboxypeptidase/D-alanyl-D-alanine-endopeptidase 
Protein accessionYP_483386 
Protein GI86742986 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2027] D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) 
TIGRFAM ID[TIGR00666] D-alanyl-D-alanine carboxypeptidase, serine-type, PBP4 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.654981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTCAG CCGCACCGGT CGCCCGGGTA GGGGGGCTCA CCGCCCTGGC CGGCGCGCTG 
CTCGCCGGGT CCCTGTCCGT CGGTCCCACC ACCGGATCAG CCGCGGAACT CGCCGGCAGC
GGGTCCGACC GCGCGGGCGC TGCGGTGCTC GGCGGTCTCG ATTCGGCCGC GCCGATCCCG
CAGCCCGCGG CGGTGGCGGC CCGCCTCGCC GGCCCGCTGG GTGATCCCGT TCTGGCGCGG
CCCGCGGCCT TCGTCATAGA CGCACAGTCG GGTCGGGTGC TCCTCGACGT CCGGTCCCGC
ATCCCCGTGG CACCGGCCTC CACGTTGAAA ACCGCGGTCG CCACGGCCGC GTTGACCACC
TTCCCCGACG ACCGGCTGCG GACCACCGTG GTCTACAGCC CGCCCGCGGG CGGGAACGCC
CGCACATCAG GCGGAACACT GTGGCTCGTC GGCGGCGGAG ATCCGACCCT GACCGCCTCC
ACCGCGCCCG GCGGCTATCC GGCTCTGGCC CGGCTGAGCG ATCTCGCCGC TCAGGTCCGG
GCCGCGGGTA TCACCTCGGC GGCTCGGGTG ATCGGCGATG GAGGGCTGTT CGTCGGTCCG
GACCGGGCAC CGGGATGGCG CGACAACTAC GTGACCGACG GGGACGTCGC CCCGGTCTCG
GCGCTGGAGG TCGACGCCGG ACGGTCGGCA CCGGAAGCCG TCGGACCGCG CAGCCCCACC
CCGGCCGCCG CCGCGACCAC GGCCTTCGCC GCGGCGCTCA TGACCGCCGG AGTATCCGTC
GGATCGGTGG CCACCGGCCC GGCGGACCCG GCCGCACAAC GGGTCGCGGC GGTCTACAGC
CCGCCGGTGC GGGTCTTGGT CGAACGCATG CTCACCCAGT CAGACAACGA TCTCGCCGAG
AGCCTGGGAC GGCTCGTCGC CCATCGGAGG GGCCTGCCCG CCTCGTTCAC CGGCGCCACC
CGCGCGGTGA GGGACGTCCT CGCCGAGGCG GGCCTCCCCG TCGCAGGGAT GGCACTGGCC
GACGTGAGCG GCCTGTCCAT CGCAAACCTG ATCATGCCCA TGACCCTTGT GGCCATCCTG
CGCGCCGCCG TCCTCCCCGG TCGGCCGGCG CTGCGCACCA TTCTCACGGG GCTGCCGGTA
GCCGGGTTCT CCGGTACGCT AGGTGACCGC TACACCACTG GCGACACCGC CATCGGAGCG
GGTGACGTCC GGGCGAAGAC CGGCACTCTG CACAACGTCA GCAGCCTCGC GGGCCAGCTC
GTCGACGCGG ACGGCAGGCT GCTCCTCTTC GCGTTCCTCT CGCCCGCCGA GGAGGCGGGC
AGCACCAAGG CCGCCCTCGA CCGGGCAGCC GCGGCGCTCA CCGGCTGCGG ATGCACCCCG
GCCCGGTCGA CGACCGTGGC GATCCCACCG GCAGCATCGC GCCCACCGGC AGCATCGCGC
CCACCGACCT GA
 
Protein sequence
MTSAAPVARV GGLTALAGAL LAGSLSVGPT TGSAAELAGS GSDRAGAAVL GGLDSAAPIP 
QPAAVAARLA GPLGDPVLAR PAAFVIDAQS GRVLLDVRSR IPVAPASTLK TAVATAALTT
FPDDRLRTTV VYSPPAGGNA RTSGGTLWLV GGGDPTLTAS TAPGGYPALA RLSDLAAQVR
AAGITSAARV IGDGGLFVGP DRAPGWRDNY VTDGDVAPVS ALEVDAGRSA PEAVGPRSPT
PAAAATTAFA AALMTAGVSV GSVATGPADP AAQRVAAVYS PPVRVLVERM LTQSDNDLAE
SLGRLVAHRR GLPASFTGAT RAVRDVLAEA GLPVAGMALA DVSGLSIANL IMPMTLVAIL
RAAVLPGRPA LRTILTGLPV AGFSGTLGDR YTTGDTAIGA GDVRAKTGTL HNVSSLAGQL
VDADGRLLLF AFLSPAEEAG STKAALDRAA AALTGCGCTP ARSTTVAIPP AASRPPAASR
PPT