Gene Franean1_5902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5902 
Symbol 
ID5674223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7169379 
End bp7171256 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content74% 
IMG OID641244750 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_001510152 
Protein GI158317644 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCG CTGGACCGCG CGCCTCGACA GGGCCCCGAC AGTGGCCTGA GTCGCCCCGC 
ACCGGCTCCG ACCCCGCAGC GGGCGGGCGG CGCCGGCCGA GGTCACGGCC GCACGCACGC
TCGCGCCAGC GCGAGGTCAC CCACTCGAAC CAGGACGGGT CGGAGACCGA GCTCACCGAG
CCGAGACCGA ACCAGCGCCG CGGCGGCTGG GACAGGCTCC AGCGCCGGAA CCCGGGCGAG
GCCGGCCCGG ACGCCCGAAC CCGGGAGCAG CACCTGGAGC AGAGTCAGGA CCACGCCCAG
GACGGCGTCC ACGACGACAC CACCGACCAC GACCACAGCA CCGACCACGG CCACGGCCAC
GGCCACGGCC ACGGGTCAGG GTGGCTCCCG GGGTCGGCCG GTCGGCGGCG TGGCGTGGCG
TCGCGGCTCG CCGTCGTCGT CTCCGCCATC CTGTCGTGCC TGATCTTCGC CTTCGCCGTC
GGCGGGTTCG CCGTCTACGA GCACTTCGAC CGCCAGATCA ACCGGCTGCG GCTGAGCCTG
GACGGCGACC GGCCCGCAAG CCCCGTCGAG GGGACGACCA ACTTCCTGCT CGTCGGCTCG
GACAGCCGGG CCGGCACCGG GGGCGAGTTC CAGCGCGGCG GCAAGGTCGC CGGCCAGCGT
TCGGACACCA CGATCCTCGC CCACCTCGAC GCGAACGGGA CGACGACCCT GGTGTCGTTC
CCGCGCGACA CCCTCGTGCG CATCCCCGGG CACGGCCGGG ACAAGCTGAC CCAGGCGATC
TCCATCGGGG GCCCGGGGCT GCTGGTCCGG ACCATCGAGA ACCTCACCGA CATCCGTGTC
GACCACTACG TGTCGGTCGA CCTCGCCGGG TTCCGCGAGA TGACCGACGC GATCGGCGGG
GTCACGGTCT GTGTGAAGGC GCTGCCGGAC GGGCGGCGGA CGAACCTGCG TGACGAGTGG
TCCCAGTGGC GGGGGCGGGT CGGCGAGAAC CACCTGACCG GCGACCAGGC CCTCGCGTTC
GTCCGCCAGC GCCACGGCCT GCCCGACAAC GACTTCGACC GCATCCGCCG GCAGCAGCAG
TTCATCGGGG CGGTCTTCCG CAAGGCCACC AGCGACGGCG TGCTGACGAG CCCGGCCCGG
CTGGAGAACC TGATCAGCGC GGTGACGCGG GCGCTGACCA TCGACGACGG AACGGACATC
GAGGATCTCC GGCTGCTCGC GAAGCGGATG GGGTCGATGA GCTCCGACCA GATCAGGTTC
GTGACGATCC CCGTGCACGC GCCGTCGCCG GCCGAGGGCG GGAACGCGCT CGGCGAGCTG
CCCCGGTTCG GTTCCGTGCA GCTTTACGAC CAGGCGCAGC TCGACGCGTT CCTGGCGCCG
CTGCGCGGCC GGGACGGCAC CAGCCCGACG GCCGTCCCCG CCCCGCCGGC GTCGCCCCCG
GGCGAGGTTT CCGTCGACGT GTTCAACGCC GCGCGGGTCG GGGGGCTCGC AGCGGCCGTG
CGCAGTGACC TCGCCAGTCT CGGGTTCCGC GTCGGAACCC CGCGGGACTG GCCCGCCGGC
TCGCTGCAGA CCAGCGAGGT GCGGTACGGG CCCGGCGGCG AGGCGGCGGC GCGCGCCGTG
CGGGCCGTCG TGCCCGACGC CAGGCTTGTC CGCGACGACG ACCTGGCCGA CCGGATCTCC
CTGGTGCTGG GCGAGTCGTT CGAGAAGGTG GACGCGACCG GCGTCCCCGC GGCCGGGGCC
CGAGCGGTCT CCGGTATGCG CCCGTCGGCA CCGGGCAGCG CCTCGACGGG GTCCGCGGGC
CCGGCCCTGT CCGGAGCCCC CACCAGGCCG ACCGCCCCGG TGACGGCGAC CGAGCTGACC
ACCGGCTGCA CGTACTGA
 
Protein sequence
MTGAGPRAST GPRQWPESPR TGSDPAAGGR RRPRSRPHAR SRQREVTHSN QDGSETELTE 
PRPNQRRGGW DRLQRRNPGE AGPDARTREQ HLEQSQDHAQ DGVHDDTTDH DHSTDHGHGH
GHGHGSGWLP GSAGRRRGVA SRLAVVVSAI LSCLIFAFAV GGFAVYEHFD RQINRLRLSL
DGDRPASPVE GTTNFLLVGS DSRAGTGGEF QRGGKVAGQR SDTTILAHLD ANGTTTLVSF
PRDTLVRIPG HGRDKLTQAI SIGGPGLLVR TIENLTDIRV DHYVSVDLAG FREMTDAIGG
VTVCVKALPD GRRTNLRDEW SQWRGRVGEN HLTGDQALAF VRQRHGLPDN DFDRIRRQQQ
FIGAVFRKAT SDGVLTSPAR LENLISAVTR ALTIDDGTDI EDLRLLAKRM GSMSSDQIRF
VTIPVHAPSP AEGGNALGEL PRFGSVQLYD QAQLDAFLAP LRGRDGTSPT AVPAPPASPP
GEVSVDVFNA ARVGGLAAAV RSDLASLGFR VGTPRDWPAG SLQTSEVRYG PGGEAAARAV
RAVVPDARLV RDDDLADRIS LVLGESFEKV DATGVPAAGA RAVSGMRPSA PGSASTGSAG
PALSGAPTRP TAPVTATELT TGCTY