Gene Francci3_4053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4053 
Symbol 
ID3907014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4843540 
End bp4844994 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content70% 
IMG OID637881382 
Productdiaminopimelate decarboxylase 
Protein accessionYP_483132 
Protein GI86742732 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCTGC CGCCCTCGGC CGCGCCGCTG CGCGCCCTGC CCGCCGAAAG CCTCCCCCGT 
CCGGTGTTGA CCGCGGTGCG TGCCTGGCCG GCCGGCGCCG GAGAGGTCAG CGCCTACTTT
TACGACCCGG CGGCGGCCGC GCGGAACGCC GCGGCGCTGC GGGCCTGCCT GCCCATGTGG
GCGGAGGTCT GCTACGCGGT CAAGGCCAAC AGCTTTCCGC CGATACTCGA CGCCCTGTTG
CGTGGGACCG GTCCGGACAT CCCTCGGTCG GGGCGGGAAC GGAGTGACTC CAGCCCGGGC
GCCAGCCCGA CCGACGCCAG GAGCGACGCC AGGAGCGACG CCAGGAGCGA CGCCAGGAGC
GACGCCAGGA CCGGCGCCAG CGGTCGGGGC ATCGACGGTT TCGAGGTCTC CTCCGCGCAC
GAGGTGCGCC TGGCGCAGGC AGCGTACGGC CGGGTCGACT CCGCCCGACG TCCCCGCCTG
GTCGCATCCG GACCGGGAAA GTCGATGACC CTGCTTACCG CGCTGGTCTC GGCCGACGTC
GACGTCGTCA ACGTCGAGAG CTTCGGCGAA CTGCGCCGCC TGGACGCCGT CGCGGCCCGG
CTCGGCCGTC GGGTGCGGGC GGCGGTGCGC GTCAACCCGG CCCAGGTAGG CCTGGCAGGC
TCCCTCCAAA TGGGCGGTCG GCCGAGTGCG TTTGGCATCC CCGAAACGGA GGTCCCCGCC
GCGCTCGAAC TCGCGGCGGA TCTGGCCCAT ATCGACGTCG TCGGATTCCA TGTCCACGCA
GTGTCCGGCA ATCTCGATGC CGCCGCCCAT CTGGGCTACG TCCGGTGGTG TCTCGATTTC
GCCGTCCGGA CGGCTGCCGT GGCCGGGATC GAGCTGCGGA TGGTCGACGT GGGCGGTGGA
TTCGGGGTTC CGTTCGAACC GGACCGGCGC GGTGACCGGC CCTTTGACCT AATCCGATTT
GGTGAGGGAC TAAGTTCGCT GCCGGTGCCG GACGGAACTC GGGTAATCTT CGAGCCGGGG
CGGATGCTGG TGGCCGACTC CGGCTGGTAC GCCGCCGAGG TGATCGACGT CAAACATTCC
TACGGGACGG AATTCGTGAT TCTGCGTGGC GGAATTCATC ATTTCGCGCT GCCGACATCG
TGGGAGATCG TCCATAACTT CGCCGTCGTG GAGAGATCGG GCCGGAACGG TGGGCCGGAG
CCGGAGCACA CGCGGGAGGC GGTGGTCGAA GGTCGTCCGG TGACGGTGGT CGGGGAGCTG
TGCACGCCGG AGGACACCCT CGCCCGTGAC ATCACGGTCG ATCGGGTGCG GCCGGGTGAC
GTCGTCGTCT TCCCCATGGC GGGTGCGTAC GGCTACGAGT TCGCACTGCC GGATTTCCTC
GGCCACCCCC GGGCCCGGCG CATCGAGCTT CGGGGAACAC GGCATGCTCC GTTCCCCACC
GATCGGTCGA GCTGA
 
Protein sequence
MDLPPSAAPL RALPAESLPR PVLTAVRAWP AGAGEVSAYF YDPAAAARNA AALRACLPMW 
AEVCYAVKAN SFPPILDALL RGTGPDIPRS GRERSDSSPG ASPTDARSDA RSDARSDARS
DARTGASGRG IDGFEVSSAH EVRLAQAAYG RVDSARRPRL VASGPGKSMT LLTALVSADV
DVVNVESFGE LRRLDAVAAR LGRRVRAAVR VNPAQVGLAG SLQMGGRPSA FGIPETEVPA
ALELAADLAH IDVVGFHVHA VSGNLDAAAH LGYVRWCLDF AVRTAAVAGI ELRMVDVGGG
FGVPFEPDRR GDRPFDLIRF GEGLSSLPVP DGTRVIFEPG RMLVADSGWY AAEVIDVKHS
YGTEFVILRG GIHHFALPTS WEIVHNFAVV ERSGRNGGPE PEHTREAVVE GRPVTVVGEL
CTPEDTLARD ITVDRVRPGD VVVFPMAGAY GYEFALPDFL GHPRARRIEL RGTRHAPFPT
DRSS