Gene Franean1_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1870 
Symbol 
ID5670272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2248178 
End bp2249191 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content71% 
IMG OID641240792 
Producthelix-turn-helix type 11 domain-containing protein 
Protein accessionYP_001506214 
Protein GI158313706 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.199317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGAC CGACCGCTCA CGTGCTCACC CTGCTGGAGC TCCTGCAGTC GGGTGGCACC 
AGGACGGTGG CCGAGCTCGC CGATCGGCTC GGCGTCGACG GGCGCACCGT GCGGCGGTAC
GTGCAGCACC TGATCGACCT CGACGTGCCC GTCGAGTCGG TGCGCGGCCG CTACGGCGGG
TACCGGCTCG CCGCCGGCTA CCGCCTGCCT CCGCTCATGC TGAACGACGA CGAGGCGCTC
GCCGTGCTAC TGGGCCTGAT CGCCAGCCGC CGAGCGGGCC TGCTGACGAC GACCGGCACC
GCAAGCGAGA CGGCGGCGGC CAAGATCCGG CGGGTACTGC CCGAACGGCT CGCGCGCGGG
CTGGACGCCG TGCTCGACTC CCTCGCCTTC ACGGCCCCGC CTGGCGAGGC AACCGCCGCG
GAAACCGCCG TCGTGCTCCC CATCGCCGAC GCGGTACGCC ATCATCGGCC GATCTCGATC
AGGTATACCG CCGCCGACGG ACGGCGCAGC GAACGCACCC TGCATCCGTA CGGGCTCGTC
GCCCACAACG GCCGGTGGTA TGTCACGGGT GCGGATCCCG GGATCGGCGA GGACCGGACC
TTCCGGCTGG ATCGCATCGA GCACGCACGG ACCCTGCCGG GCTCATTCGA GCCGCCCGAC
GGACTCGATC CGGCGCAGCG CGTCCTATCG GGACTCGCCA ATGCTCCGTA CCGGCATGAG
GTGATTCTGC GGATCCAGGG AACTGTCGAA CAGATTCGCG CCCGGCTTCC CGCCAGCGTC
GCGATCGTGG AAGAGCCTCC GTCCACGGGA GACACCGATC CCGGAACCGA GCGCTGGCTG
CGGGTCGAGC TCCGAGCGGA ACGGCTCGAC TGGCTTCCCC CGGTGCTCGC GTCACTCGAC
CGGCCCTTCG TCATCGAGCG ACCAGATGAC CTCCGCGGCC TCGTTCTGGC GCTCGCCGAC
CGCCTCGCGA CCTCCGCCCA CACAGACCGC CTCGCACATG ACCGGAACCC ATGA
 
Protein sequence
MARPTAHVLT LLELLQSGGT RTVAELADRL GVDGRTVRRY VQHLIDLDVP VESVRGRYGG 
YRLAAGYRLP PLMLNDDEAL AVLLGLIASR RAGLLTTTGT ASETAAAKIR RVLPERLARG
LDAVLDSLAF TAPPGEATAA ETAVVLPIAD AVRHHRPISI RYTAADGRRS ERTLHPYGLV
AHNGRWYVTG ADPGIGEDRT FRLDRIEHAR TLPGSFEPPD GLDPAQRVLS GLANAPYRHE
VILRIQGTVE QIRARLPASV AIVEEPPSTG DTDPGTERWL RVELRAERLD WLPPVLASLD
RPFVIERPDD LRGLVLALAD RLATSAHTDR LAHDRNP