Gene Franean1_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1248 
Symbol 
ID5669661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1503306 
End bp1504844 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content75% 
IMG OID641240180 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_001505608 
Protein GI158313100 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.375848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0192244 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGGAC GTCCGGCGCA GGGCCGCCCG GCGCGGGACC GACCGGCCGC CGGCGAGGAG 
AACTGGCCGG CGGGAGCCTG GCCGCGCCAC GAGCCCCGCT CGGCTCCCGC GCCGATCCCG
CCCACCCGCC GGCTGCCTCC GCCGGGCGCC ACCGACGGCG GGCGGACCTG GCCCGCCCCG
AACGACGGGC CCGGCGGGTA CCGCGCGCCG GCCGGCCCGG TGCCCGGGTA CGGCGGCCCG
CGCGGGCCGT ACGACACCCC GGGCGAGGAC CTGCCCGCGG AGGAGCCACA CCGCCCGGTG
AGCGGTGTGC GCCGGACGGT GACCCTGGTG GCCGCCATCG TCTCGGTGGC CGTCCTGGTC
GTCGCCACGA GCGGCTGGGC CGTGCTGCGC CACTACGACG GCAAGGTGAA CCACATCGAG
CTCACGTTCT CCGACTCGGC CGCGCGGCCC TCCGCCGCCG GCGGAGGCAC CCAGAACATC
CTGCTGGTGG GCTCGGACAC CCGCGCGGGC ACCGGGGGCG AGTTCGGCCA GACCGAGGGG
CAGCGCTCGG ACACCACGAT CCTCGCCCAC CTCGACGCCG ACGGCTCGAC GACCCTGGTG
TCCTTCCCCC GCGACCTGTG GGTGCAGATC CCGGGCTACA CCGGCTCCGA CGGCACCCAG
CACGACGCGC AGAAGTCCAA GCTCAACGCG GCGTTCGCCT ACGGCGGGCC GTCCCTGCTG
GTCCGGACCA TCGAGACGCT CACCAACATC CGGATCGACC ACTACCTCGA GATCGACTTC
CTGGGCTTCC AGGCGATGAC GGACGCGCTC GGCGGCGTCA CCGTCTGCGT GAAGGAGCTG
ACGCCCGAGC TCAAGGCGCA GGGCTTCGAC AACCTCAACG ACCGGTACTC CGGCTGGCAC
GGCCAGGTCG GCAACAACAC GCTCACCGGT GAGCAGGCGC TGGCCTTCGT CCGGCAGCGT
TACGGCCTGC CCGGCAGCGA CCTCGACCGC ATCCACCGCC AGCAGCAGTT CCTCGGCGCG
GTGTTCCGCG AGGTCGCCTC CACCGGGACC CTGCTCAACC CGCGCAAGCT GCTCGACGTC
GTGGACGCGG CCACCTCCGC GCTGACCCTG GACGACCACA CCTCGCTCAC TGATCTGCGC
CTGCTCGCCG TCCGGATGCA GGGCATCAGC ACCGGTGGGG TCACCTTCGC GACCGTCCCG
GCCACGCCGT CACAGGCCGG CGGGCAGTCC GTGCTCCTGG CGAAGACCGA CGAGCTGACG
ACGCTGCTCG CCGGGATCGG CGGCTCCCCG CCGCAGGCCG CCGGGCCCCC CGCGCTCGGC
CCCGCCGGCT CGCCGTCCGG CTTGACGGCG GCCTCGGCGG CCTCGGCGGC CTCCGTGGGT
GCTACGGCCG CGTCCGGTGC CGTGCCGGCG TCCGGTACCG GCCACGGTTC GGTGGTCACC
GCGGACCTGC GCGCCACCGG AGGGCGGCCC GCGGGCGGCG CGGTGACCCT GGCCCAGGCC
ACCCCCGAGC CGTCCGGCGG GGTGGGCTGC ACCTACTGA
 
Protein sequence
MQGRPAQGRP ARDRPAAGEE NWPAGAWPRH EPRSAPAPIP PTRRLPPPGA TDGGRTWPAP 
NDGPGGYRAP AGPVPGYGGP RGPYDTPGED LPAEEPHRPV SGVRRTVTLV AAIVSVAVLV
VATSGWAVLR HYDGKVNHIE LTFSDSAARP SAAGGGTQNI LLVGSDTRAG TGGEFGQTEG
QRSDTTILAH LDADGSTTLV SFPRDLWVQI PGYTGSDGTQ HDAQKSKLNA AFAYGGPSLL
VRTIETLTNI RIDHYLEIDF LGFQAMTDAL GGVTVCVKEL TPELKAQGFD NLNDRYSGWH
GQVGNNTLTG EQALAFVRQR YGLPGSDLDR IHRQQQFLGA VFREVASTGT LLNPRKLLDV
VDAATSALTL DDHTSLTDLR LLAVRMQGIS TGGVTFATVP ATPSQAGGQS VLLAKTDELT
TLLAGIGGSP PQAAGPPALG PAGSPSGLTA ASAASAASVG ATAASGAVPA SGTGHGSVVT
ADLRATGGRP AGGAVTLAQA TPEPSGGVGC TY