Gene Francci3_4055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4055 
Symbol 
ID3907016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4847427 
End bp4849091 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content74% 
IMG OID637881384 
Productpyridoxal-dependent decarboxylase 
Protein accessionYP_483134 
Protein GI86742734 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0076] Glutamate decarboxylase and related PLP-dependent proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.505018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGATC TCGCGTTGGC CGCCCTCGAC GATGGCCGCC GGATCCGCGG CGGCCCGCTG 
CCCGCCGGTG GTCCACCGGT GGTGCGCGCC ATGACCAGAG CTGCCCTCGG CCTGACTCCG
GCCGATCCGG CCCCGGCCGA TCCGGCCCCC CCGGACAGTA CAGGGACCCT GCTACCGAGG
ACCGGAATCG GCCCGTCGGC CGCGCTGACC GGCCTGGTCG AAGCCCTGGC CGCCGGTTCC
GCGGACCCGG CGGACCCGTG GTGCGCCGCC CATTTGCACT GCCCTCCGCT GGCGGTCGCG
GTCGCCGCCG ACCTCGCCGT GAGCGCCCTC AATCCGTCCC TGGACTCCTG GGACCAGGCT
CCCGCGGCCA CGACGATCGA GACCGAGGTC GTCGCGACCC TGGCCGCACT GGTCGGCTTC
GATCCGGACC GGGCCACCGG CACGATCACC ACCGGTGGGA CCGAATCGAA CCTGATGGGC
CTGTTCCTCG GTCGGGACGC GGCCCGCCGT GGCCCGACCG GGCAAGACGC TTTGGGCCCG
GCCGCCCCAG GTCACGATGC CGGGCACCAA GGAGGCCCCG GGCACCGAGG AGCGGACCGG
CCGGCACCTC GGGTGTTCCG TTCGGCGGCG GCACACTTCT CGATTGATCG GGCGGCGTCC
CTGCTCGCGA TCGACGCGCC TCCGGTGCTG GTCATCGAGA CCGACGACCG GCACCGGATG
CGCGTCGAGG CGCTCCGTCA CGCCCTGGCC GCGCACGCCG GGCCGGCCAT CGTCGTGGCC
ACCGCTGGTA CCACCGATGC GGGCGCGGTC GACCCGCTTC GGCAGATCGC CGCGGTGGTC
GAGGAACATC GCCTCCGGTT AGGTGGGTCG CAGGACCCGA GCAGGCCATC GGGCATGGAG
GCAAGGTCCG GCAGGGCGGC CTGCTGGTTT CACGTGGACG CCGCCCACGG CGGCGGCGCG
TTGCTGTCCG AGCGGCTGGC CGGCCTGCTG GACGGACTCG ACCTCGCAGA TTCGGTCGCA
CTGGATCTGC ACAAACTCGG CTGGCAGCCG GCGCCGGCCG GGGTCTTCCT CACCGCCCGC
GACGACGGCT GGGCCAGCCT CGCGACACGG GCGGAGTACC TCAACCCGGA GGATGACGAG
GCGGCCGGCT TCACCAGCCT GCTGGGCCGG TCGCTGCGGA CCACCCGCCG CGCGGACGCC
TTCCCCATCG CGGTCACCCT TCGGGCCCTC GGCCGCGACG GTCTCGGCGC CCGGGTGGAT
GCCTGCCATG ATCTCGCGCA GCACGCGGCA CGGACGGTGC GCGCCGACCC CCGTCTCGAA
CTCGCCTTCC CGGTAACACT CAGCACCCTC GTCTTCCGCT ACCGGCCGCC ACAGGCCATC
TCGGATCCGA CCGCGTCCGG CCCCACCGCA TCCGGCCCCG CGGGCCCGCC CTCCCCCGAC
CGGGTGGACC GGATCAACGC CCGGCTGCGC CGGGAACTGC TCGTCGCCGG CCGGGCCGTC
GTGGGGCGCA CCCAACTCGG TCCCCGTCAC GCGGTCTTCC TGAAGCTGAC TCTGCTCAAT
CCCGACGCCA CCACGGCGGA CGTCGACGCG CTGCTCGGGC TGATCGCCGA GACCGGGGAC
GTCCTTGCCG CCGGTGGTTC CACCAGGGCC CGGCGGCGGT CGTGA
 
Protein sequence
MVDLALAALD DGRRIRGGPL PAGGPPVVRA MTRAALGLTP ADPAPADPAP PDSTGTLLPR 
TGIGPSAALT GLVEALAAGS ADPADPWCAA HLHCPPLAVA VAADLAVSAL NPSLDSWDQA
PAATTIETEV VATLAALVGF DPDRATGTIT TGGTESNLMG LFLGRDAARR GPTGQDALGP
AAPGHDAGHQ GGPGHRGADR PAPRVFRSAA AHFSIDRAAS LLAIDAPPVL VIETDDRHRM
RVEALRHALA AHAGPAIVVA TAGTTDAGAV DPLRQIAAVV EEHRLRLGGS QDPSRPSGME
ARSGRAACWF HVDAAHGGGA LLSERLAGLL DGLDLADSVA LDLHKLGWQP APAGVFLTAR
DDGWASLATR AEYLNPEDDE AAGFTSLLGR SLRTTRRADA FPIAVTLRAL GRDGLGARVD
ACHDLAQHAA RTVRADPRLE LAFPVTLSTL VFRYRPPQAI SDPTASGPTA SGPAGPPSPD
RVDRINARLR RELLVAGRAV VGRTQLGPRH AVFLKLTLLN PDATTADVDA LLGLIAETGD
VLAAGGSTRA RRRS