Gene Franean1_2523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2523 
Symbol 
ID5675695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3002768 
End bp3003874 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content66% 
IMG OID641241439 
Productpoly-gamma-glutamate synthesis protein 
Protein accessionYP_001506860 
Protein GI158314352 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCGAGG GTGCGGTGAG GTTGTTCCTC TGCGGCGACG TGATGCTCGG CCGCGGTATC 
GACCAGATCC TGCCGCATCC CGGTGACCCG ACACTTTCCG AGAGATACGT CTGGGATGCC
CGCAGCTACG TCGAGTTGGC AGAGGCGGTG AACGGCCCGA TCCCTCACCC GGTCGACTTC
GCCTGGCCCT GGGGAGATGT GCTGCCAGCG CTGGACGAGG CTGCGCCCGA TGTCCGGGTG
CTGAATCTGG AAACCACCAT CACCCGGTGC GATGCCTTCG CGACGGGGAA GGAAGTCCAC
TATCGGATGA GCCCGGACAA CCTGCCCTGC CTGACTGCCG CTCGACCTGA CGTGTGTGTG
CTGGCAAACA ACCATTTGCT TGATTTCGGT CACCGGGGGC TCATCGAGAC GCTCGATGTG
CTGTCCGGTG CCGGGCTGAT GGGGGCGGGA GCCGGACACG ACGCAGAAGA GGCCCGCCGG
CCCGCGGTTG TGCCGATCGA TGGCAGCCGG CGGGTCCTGA TCTTCTCGAT CGGGCTGCCG
TCCAGCGGCA TTCCAGCGAC GTGGGCCGCG ACCGAGGGCA GGGCCGGCGT TGACCTCGTC
CCGGAGCTGT CGGATGGCTG GACCGACGAG GTCGCGGGCC GTGTCCGGCA GGTGAAGCGG
CCCGGTGACC TCGCCGTCGC GTCCATCCAC TGGGGCTCCA ACTGGGGCTA CGACGTCGAT
GACGACCAGA TCCGCTTCGC GCATCGGCTG GTGGACGGCG GTATCGACAT CGTGCACGGA
CACTCGTCGC ACCATCCACG GCCTGTCGAG ATATATCGGG ACAGGCTCAT CCTGCACGGG
TGCGGCGATT TCATCGACGA CTATGAGGGG ATCGCCGGCT ACGAAAGCTA TCGGGACGAC
CTGCGGCTGG CGTACTTCGT GTCGGTCGAT CCGGGCAGCG GGAATCTGAT CGACCTGCGT
GTGATGCCCC TGCAGGCACG GCAGATGCGA CTTCGGTACG CCGGCTCCGA AGACTCCGCC
TGGCTACAGG AGATTCTTGA CAGCATCAGC CGCGGCTTCG GGACACGGTT CGGCCTTCAG
GCGGACGGCA TGCTCTCGCT CCGCTGA
 
Protein sequence
MCEGAVRLFL CGDVMLGRGI DQILPHPGDP TLSERYVWDA RSYVELAEAV NGPIPHPVDF 
AWPWGDVLPA LDEAAPDVRV LNLETTITRC DAFATGKEVH YRMSPDNLPC LTAARPDVCV
LANNHLLDFG HRGLIETLDV LSGAGLMGAG AGHDAEEARR PAVVPIDGSR RVLIFSIGLP
SSGIPATWAA TEGRAGVDLV PELSDGWTDE VAGRVRQVKR PGDLAVASIH WGSNWGYDVD
DDQIRFAHRL VDGGIDIVHG HSSHHPRPVE IYRDRLILHG CGDFIDDYEG IAGYESYRDD
LRLAYFVSVD PGSGNLIDLR VMPLQARQMR LRYAGSEDSA WLQEILDSIS RGFGTRFGLQ
ADGMLSLR