Gene Franean1_5205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5205 
Symbol 
ID5673539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6249412 
End bp6251094 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content71% 
IMG OID641244059 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_001509469 
Protein GI158316961 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0714783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGACG CGCGCGCACG GCAGTCGTGC GGCGCCCCCG AAGTGACGCC GGCAGAGACC 
GCGGACGGTG AGCACGACAC CCGCGAACGC GACATCGGCG CCCGCGGCGC GGAACTCGAC
GGGCTAAAGA TCGGCGGCCC GGACGTCAGC GGCGACGACA TCGGCGGCGA GACCGAAGGC
GGCAGCAAGG CCGGCCCGCC GGGTAACGGG AGCGGCGCGG CGGCCGGCGC CGCGCGGCGC
GGCCCGATAC GCCGGGTTCT GCTCGTCCTG ACCGCCCTGC TGTCGGTCGC CGTGGTGGTG
GTGACCACCA CCGGCTGGTT CGTCATCACC TTCTACGACC GCAGGATCGA TCGCGAGACC
ATCGCGCCGC CTGCCGACAT CACGGTGACC CGCCCACCGC CCGCGCCGGT CGGCACCGAG
ACCTGGCTCC TCGTCGGCTC CGACGTGCGC ACCGGCTCGG ACGCCGCGGC GGTCAGTGGC
GCGCGATCCG ACACTATGAT GATCGCCCAC CTGGCCTCGG ACGGGCGGAC GAACATCGTG
TCGGTCCCCC GTGACCTGAG GGTGCCCATC CCGGCCTGGA CCGACGACGA CGGCACCCAC
CACCGGGCCC GCCGAGACAA GATCAACGCA GCGTTCGGCA GCGGTGGCCC CGCGCTCCTC
GTCGCCACCC TCGAGCAGGT GGCGGGACTG CGCATCAACC ACTACGCCGA ACTCGACTTC
AACGGCTTCC AGCAGATGAC GTCCGCGATC GGTGGCATCG ACGTGTGTCT GCAGGCATCG
AGCTACGTCG AGCCGCACAC ACTGGAGAAC GGCCGGCGGG TGCGGTCGAT GAACCTGAAC
GACCCCAGCT CCGGTTTCCT CGGGCAGCCG GGAAACAATC ATCTGATCGG CGGCAATGCG
CTTGCCTTTG TTCGGCAACG ACATGGTTTC GCTGACGGCG ACCTCTCTCG GATCCGCCGC
CAGCAGGCGT TTCTCGCGGC GATGTTCCGA AAGGTCAGCA GCAGCGACGT CCTCTTGCGC
CCGACCAAGC TCGCCGCGTT TCTGGGCGCG GTGACGCGGT CGGTGGTGCT GGACGACGAG
ACCGGCTTCA CCGAGCTGCG CGCGCTGGCC GAGCGGATGC GCGGGATGAC GACCGGCGCC
GTCACGTTCT CCACCGTCCC GATCACGGGC CAGATCGCCG AACCGGCCTT CTACTTCCTG
TATGACCCCG ACCAGATGCG GCAGTTCTTC CGGAACATCA CCGGCGGCGA GTCCCTGCCC
GAGCCCACCG GCTCCGGAGA CCTCATCCCG CTCGGCGGCG CCTTCACCCC GGAACCCTCG
ATCGGCGCAC CGACCGCCCC CACCGCGGCA GTCGCCCTCC CACCCACCGA GAGCGCCACC
CCAACAGTGA CGCCTCAGGT ACCCGACGTG GCATCGACGC CCGCCTCCCC GGCCGAACAG
CCAACAGTCA CCCCCACGCC CCCGCCTACG GCGATACCCA CGGCCACGGC GCCGACCGGT
GTCAGTGTCG GCGCCGGCGT CGACATCCTG GCCCGGCCGG TCGCCGGGAC CGGGCAGGAC
GCCACGGCCG GCACCACCCC GGGCACACTT GGGCCGTCCG CCACGCTGCC GGTGGGGCCG
TCGGTGGGAT CGTCCGCGAC GACCGAGCCG CCGGTGACGG CCGCCGCCGC CTGCATCTAC
TGA
 
Protein sequence
MRDARARQSC GAPEVTPAET ADGEHDTRER DIGARGAELD GLKIGGPDVS GDDIGGETEG 
GSKAGPPGNG SGAAAGAARR GPIRRVLLVL TALLSVAVVV VTTTGWFVIT FYDRRIDRET
IAPPADITVT RPPPAPVGTE TWLLVGSDVR TGSDAAAVSG ARSDTMMIAH LASDGRTNIV
SVPRDLRVPI PAWTDDDGTH HRARRDKINA AFGSGGPALL VATLEQVAGL RINHYAELDF
NGFQQMTSAI GGIDVCLQAS SYVEPHTLEN GRRVRSMNLN DPSSGFLGQP GNNHLIGGNA
LAFVRQRHGF ADGDLSRIRR QQAFLAAMFR KVSSSDVLLR PTKLAAFLGA VTRSVVLDDE
TGFTELRALA ERMRGMTTGA VTFSTVPITG QIAEPAFYFL YDPDQMRQFF RNITGGESLP
EPTGSGDLIP LGGAFTPEPS IGAPTAPTAA VALPPTESAT PTVTPQVPDV ASTPASPAEQ
PTVTPTPPPT AIPTATAPTG VSVGAGVDIL ARPVAGTGQD ATAGTTPGTL GPSATLPVGP
SVGSSATTEP PVTAAAACIY