Gene Francci3_0181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0181 
Symbol 
ID3903150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp212677 
End bp214077 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content70% 
IMG OID637877513 
Productglutamate decarboxylase 
Protein accessionYP_479302 
Protein GI86738902 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0076] Glutamate decarboxylase and related PLP-dependent proteins 
TIGRFAM ID[TIGR01788] glutamate decarboxylase
[TIGR03382] Myxococcales GC_trans_RRR domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.305045 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTAC ATCGTCAGGC GAGTGGTGAT CGGGCGGAGG AGACGGTCGA CGTACGGCCG 
CACCTGGCCC CGCTGGGCGA GGAGGTCGTA GTCCCGCGGT TCCGGATGCC GCAGGAGTCG
ACTGCGCCGG AGACGGCCTA CCAGATCGTC CACGACGAGC TGATGCTCGA CGGCAAGGCG
CGGCTGAACA TGGCCACGTT CGTGACCACC TGGATGGACT CGTACGCCGA CCGGCTGATG
GCCGAGTGCG CCCCGAAGAA CATGATCGAC AAGGATGAGT ATCCGCAGAC GGCGGCGCTG
GAGGAACGCT GCGTCAACAT CCTCGCCGAC CTGTGGCACG CCCCCGACGC CGAGCACGCG
GTCGGCTGCT CGACCACGGG GTCGTCCGAG GCGTGCATGC TCGCCGGGCT TGCCATGATC
CGCCGCTGGC GGGCGCGTCG CCGGTCCGCC GGCGCCCCGG CCGACCGGCC GAACATCGTC
ATGGGTGTCA ACGTGCAGGT GTGCTGGGAG AAGTTCGCCC GGTACTGGGA CGTGGAGGCA
CGGCTGGTGC CGATGGCGCC GGGGCGCACG CATCTGACCG CCGACGAGGC CGTGGCGCAC
TGCGACGAGA ACACCATCGG CGTGGTGGCG ATCCTCGGCT CGACGTTCGA CGGCAGCTAC
GAGCCGGTCG CCGGGATCGT GGCCGCGCTG GACCACCTCG CCGCCTCCGG AGGGCCCGAT
GTTCCGGTAC ACGTGGATGC CGCGTCCGGC GGGTTCATCG CCCCGTTCTG CGACCCCGAT
CTGGTCTGGG ACTTTCAGCT CGACCGGGTG GTGTCCATCA ACACCTCCGG GCACAAGTAC
GGCCTGGTCT ATCCGGGGGT GGGCTGGGTG CTGTGGCGCG ACCGGGCGCA TCTACCGGCC
GAACTCGTCT TCCAGGTGGA CTATCTGGGC GGGACGATGC CGACCTTCGC GCTGAACTTC
TCCCGGCCGG GCGCACAGGT CGTCGCGCAG TACTACACCC TGCTGCAACT GGGCTATAAG
GGCTACCGCC GGGTGGCCCA GGCGTGTCGG GACAACGCCC GCTGGCTGGC CGCCGAGGTG
GCGGCGATGG GGCCGTTCGA ACTGGTCTCC GACGGTAGCG GCATTCCGGC GTTCGCGTTC
AAGCTCCGTG ACGACATCAC GGACTACACC GTCTTCGACG TCTCCGAGCT GCTGCGCACC
CGTGGCTGGC TGGTGCCCGC CTACCGGTTC CCGCCGGGGC TGACCGATCT TGCGGTGTTG
CGGGTCGTCG TGCGTCACGA GTTCAGCCGG GACCTCGCCG GGCTGCTCAT CGCCGACCTG
CGCCGGGTGG TGAACCGGCT GGCCCATCCG GGCCGGCGCA CCGCGGGCGA CCGGCCCGGC
GAGTCATCGT TCCACCACTG A
 
Protein sequence
MALHRQASGD RAEETVDVRP HLAPLGEEVV VPRFRMPQES TAPETAYQIV HDELMLDGKA 
RLNMATFVTT WMDSYADRLM AECAPKNMID KDEYPQTAAL EERCVNILAD LWHAPDAEHA
VGCSTTGSSE ACMLAGLAMI RRWRARRRSA GAPADRPNIV MGVNVQVCWE KFARYWDVEA
RLVPMAPGRT HLTADEAVAH CDENTIGVVA ILGSTFDGSY EPVAGIVAAL DHLAASGGPD
VPVHVDAASG GFIAPFCDPD LVWDFQLDRV VSINTSGHKY GLVYPGVGWV LWRDRAHLPA
ELVFQVDYLG GTMPTFALNF SRPGAQVVAQ YYTLLQLGYK GYRRVAQACR DNARWLAAEV
AAMGPFELVS DGSGIPAFAF KLRDDITDYT VFDVSELLRT RGWLVPAYRF PPGLTDLAVL
RVVVRHEFSR DLAGLLIADL RRVVNRLAHP GRRTAGDRPG ESSFHH