Gene Franean1_6483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6483 
Symbol 
ID5674798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7881303 
End bp7882808 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content79% 
IMG OID641245331 
Producthypothetical protein 
Protein accessionYP_001510726 
Protein GI158318218 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.159074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACGA TGTCCCGCAG ACGCTCCGCA CGCGGCCCGC GCCGCGGCCC GCCTCGCCCG 
GCGGGCCGTT CCGGGCCGGC GGGAGCCGCG GAACGCCACG AGTCACCGCC GGCGGCGGGT
CACAACGACC CGCCGCCGGC GCGGCACGAC CCGCCGCAGG CGCGGCTCCT GGTGGCGGCG
GTCTGCGTCC TGGCGATGGC GGCGCTGGGC CTCGGGCTCT TCTCCGATGT CCTGACCCGG
CACGTCCTGA CCCTCGGCCC TGGCCGGGCC GGTGGCCCGC CGCCCGCAGG CAGCGGCGGG
GGGCCACCCA CGCGGGGCGC CACGACGGAA GAGGCGGAAC AGCCCCCCTG GGGCGCTGCG
ACCGGTCAGC GGCCGGGGCC GGTGTGGCCA CCGCGGGCCG GCAGCCCGTC CGATGCCGGG
GACACGACCG AGCTCACCCT GACCTTCAGC GGCGATCTGG TCCTCGACCC GTCCAGCGCC
CGCGCGGCGC TGGCACCGCT CGGCAGCCTG TTGTCCTCCG CCGACCTCGC GATCTGCCGC
GGGCCGGCTC CCACCGCCCA CCCGGAGGTC ATCGCCGAGG CCCTGCGGCG GGTGGGCTTC
GGCGCGTGCG CCACCGCGTC CGGCCGGGCG GCCCGGCTGG GCGGTGCTGG GGTGCGCGGC
CTGCTGGACG CCCTCGACGG CGCGGCGATC GACCACAGCG GCACGGCCCG CGAGCCGCTG
GACGCCGCCA CCCTGTCGCT GCTGCCCGTG CGCGGCGCGC AGATCTCGCT GCTGTCCTAC
ACCGAGGACG CCGGCACCGA CCCCGCTCCC GGCTCACCCG GAGCGGACCC GCCCGGCTGG
ACGGTCAACG AGCTGGACCC GGCGCGGATC CTGCGGGACG CCGCCCGCGC CCGCCAGGCC
GGCGCCGACC TCGTCGTGGT CGCGCTGTCC TGGGCCCCCG ACCAGGCCGA ATCCGCGCGG
AGCGCGCCGA CGGGAACAGC GCCCACGGGG ACGGCTCCGA CGCAGCGGCA GCGGATGACC
GCGCGCGAGC TGCTCCGCTC CCCGCTCGTT GACCTGGTCG TGGGCACCAG CGCCGGCACG
GTGCGGCCGG TCGAACGCGT CGACGGCAAG TACGTCGCCT ACGGGACGGG CTCGATCACC
ATCCCGGCAG CGGGCGGCCT CAGCGGCGGT GCCGGCGGCG TCCCTGGCGG GGAAGCAGGC
GCCGAACCGG GCGTGGACGC CGCCGGGCGG GACCGGGAGC GGGACGGCGC GCTCCTGCAC
GCCCGGGTAC GGCGCACGGC GCTCGGCTGG ATGGTCGTCG GTCTCACCTA CAGCCCGATC
TGGACGGGGC CGGACGGCGT CGTCCGCCCG GTAGCGGACG CCCTCGACGA CCCGGGCACG
TCCGAGGCGG CACGGGCCGA GCTGACGGTG TCCTGGCTGC GCACCGTGGC CGCGCTGACC
TCGCTGGGGC AGGTCGACGG GGTCCGTCCG GAACGGGTGC CGCGCCAGCC CGGGGCCGGT
GCCTGA
 
Protein sequence
MGTMSRRRSA RGPRRGPPRP AGRSGPAGAA ERHESPPAAG HNDPPPARHD PPQARLLVAA 
VCVLAMAALG LGLFSDVLTR HVLTLGPGRA GGPPPAGSGG GPPTRGATTE EAEQPPWGAA
TGQRPGPVWP PRAGSPSDAG DTTELTLTFS GDLVLDPSSA RAALAPLGSL LSSADLAICR
GPAPTAHPEV IAEALRRVGF GACATASGRA ARLGGAGVRG LLDALDGAAI DHSGTAREPL
DAATLSLLPV RGAQISLLSY TEDAGTDPAP GSPGADPPGW TVNELDPARI LRDAARARQA
GADLVVVALS WAPDQAESAR SAPTGTAPTG TAPTQRQRMT ARELLRSPLV DLVVGTSAGT
VRPVERVDGK YVAYGTGSIT IPAAGGLSGG AGGVPGGEAG AEPGVDAAGR DRERDGALLH
ARVRRTALGW MVVGLTYSPI WTGPDGVVRP VADALDDPGT SEAARAELTV SWLRTVAALT
SLGQVDGVRP ERVPRQPGAG A