Gene Francci3_0279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0279 
Symbol 
ID3903021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp323798 
End bp325183 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content71% 
IMG OID637877607 
Productpyridoxal-dependent decarboxylase 
Protein accessionYP_479395 
Protein GI86738995 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0076] Glutamate decarboxylase and related PLP-dependent proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATCGGT TTGACGCGCA GGCTGCCGAC CTCGTCCGCG CGATATGTGA CTTTGCCCGC 
GTCCGGCTCG GATTCGATCC GGTACCGCTG GACGCGCCAT TGTCATGGGA CGAGTTGGCG
GCGGCGGTGG GATCCACGAT CACCGCCGAG GGCATCGGTG GCCGTCGCGC TCTCGAGGTG
TTCGACGAGG AGCTGTCCCG CGCCTGTATC TCGACGGATC ATCCCCGCAA TCTCGCCTTC
ATCCCGGCCG CTCCGACTAA GGCGGCTGTA TTGTTCGATC TGGTGGTCGG GGCGTCGTCC
ATCTACGCGG GCAGCTGGAT GGAGGGCGCG GGCGCGGTCT TCGCGGAGAA CGAGGCGCTG
CGATGGCTGT CGGACCTCGC GGGCTTCCCC GCCTGCGCCG GTGGGCTGTT CGTACCGGGG
GGCACCGTCG GCAACCTGTC GGCGCTGGCC GCCGCCCGGC ATGCCGCCCG GAGCCGGTTG
ACCGCCGCCG GTCGGCCGAC TCCACCGCGA TGGCGGTTCG TCTGTGGGGC CGAAGCGCAC
TCCTCCCTCT ACCAGGCCGC CACGGTGCTC GACACCGAGG TCGTCGTCGT GCCGACGGAT
GACGCCGGAC GACTGACCGG TCCGCTGTTG GCCGAGGCGC TGGACCGGCT CGCCGAGCAG
GACGGCGCCC AGGCCGTCGA CGGCGTGTTC GCGGTGGTGG CGACCGCAGG GACCACCCAG
TTCGGCACCG TCGATGACAT CCGCGGGGTG GTGGACGTCT GTCAGGCCCG CGGGCTGTGG
GTACATGTGG ACGGCGCCTA CGGGCTGGCC GCGCTCGCCG CCGCATCGAC CCACTCTCTC
TTCGACGGGA TCGCCGAGAC CGACTCGTTC ATCGTCGATC CGCACAAGTG GCTGTTCGCG
CCGTTCGATG CCTGCGCGCT GGTGTATCGC GATCCGGCGG TGGCCCGGGC GGCGCACGGC
CCGCAGCGGG CCGGCTACCT CGAGGTCCTG GATTCGGCGG GGGCCTGGAA CCCGTCGGAC
TACGCCATCG GGCTGTCCCG GCGGGCCCGC GGGCTGCCGT TCTGGTTCTC GCTGGCGACG
CATGGCACCT TGGCCTACGG CCGGGCCATC GAGTCCACGC TGGCGACCGC CCGGGCGGCC
GCGCTCCAGA TCGCCGCGCT GCCTTACGTC GAGCTGGTGC GGGAACCGCA GCTGTCGATC
GTGGTGTTCC GTCGGCTGGG TTGGCAGGCC GCGGACTACC AGCGGTGGAG CGAGAACCTG
CTGCGGGACG GTTTCGCGTT TGTTCCGCCC ACCGTGCACG AGGGCGAGAC CGTCGCCCGG
TTCGCCATCG TCAACCCGCG GACCACCGTT GACGACATCG GCGCGATCCT CGCCACGATG
GCCTGA
 
Protein sequence
MHRFDAQAAD LVRAICDFAR VRLGFDPVPL DAPLSWDELA AAVGSTITAE GIGGRRALEV 
FDEELSRACI STDHPRNLAF IPAAPTKAAV LFDLVVGASS IYAGSWMEGA GAVFAENEAL
RWLSDLAGFP ACAGGLFVPG GTVGNLSALA AARHAARSRL TAAGRPTPPR WRFVCGAEAH
SSLYQAATVL DTEVVVVPTD DAGRLTGPLL AEALDRLAEQ DGAQAVDGVF AVVATAGTTQ
FGTVDDIRGV VDVCQARGLW VHVDGAYGLA ALAAASTHSL FDGIAETDSF IVDPHKWLFA
PFDACALVYR DPAVARAAHG PQRAGYLEVL DSAGAWNPSD YAIGLSRRAR GLPFWFSLAT
HGTLAYGRAI ESTLATARAA ALQIAALPYV ELVREPQLSI VVFRRLGWQA ADYQRWSENL
LRDGFAFVPP TVHEGETVAR FAIVNPRTTV DDIGAILATM A