Gene Franean1_6608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6608 
Symbol 
ID5674923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8042000 
End bp8042992 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content71% 
IMG OID641245459 
ProductAraC family transcriptional regulator 
Protein accessionYP_001510851 
Protein GI158318343 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTGC TGAGCGACGC GGTCACGATG TTGCGCACGG GTCGTCCTCA CTCGAACAGC 
AACCGCCTGT GGGCGCCTTG GGGCATCCGC TTCCCCCCGA CCGACGGCGC GGGATTCCAC
ATCGTGCTCA TGGGCACCTG CTGGCTGCTG CGGGCGGGCG CCAGGCCACT CCGGCTCGCC
GCCGGTGACA TCGTGTTACT GCCTCGAGAA CCAGGGCATG CCCTCGCCGA TGATCCGGCC
AGCCCGCTCA CCGACTTCCG GGCCGACCCC CACGGGCCGA TCCCCAGCGA GCATGCGGAC
GGATACGGCG GGTCCGGCCC GGGCGGCCGG ACCGTCACCG AGTTGCTCTG TGGCGCCTAC
ATGTTCGACC GCTTCCGTCT GCACCCGCTG TTGGCGGACC TACCCGACGT CATCCACCTG
CCCGCGCGGG TCGGGCATCA CCCGAGGCTG CGGGCCGCGG TGGATCTGCT CGGCGCCGAG
CTCGCCGAGC CTCGCGCGGG TGCGGCCGCG AGCATGTCCG CACTGCTCGA CCTCCTGCTG
CTCTACATGC TTCGCGCCTG GTTCGACGAT CATTCGACCG GCTCGTCGAC CGGCTCGTCG
ACCGGCTGGT CCGCGGCACT CGCGGATCCC GCGGTGAGCG CGGCGCTGCG GGCGATGCAC
GCCGAACCGG AAATGCCATG GACGGTGCGT GAGCTCGGCG CGCGGGTCGG ACTGTCCCGT
ACGGTCTTCG CGCAGCGGTT CACCGCACTC GTCGGCAAGC CGCCGTTGGC GTACCTGACC
TGGTGGCGGA TGACCATGGC AGCGAGGCTG CTGCGGGAGA CCGACTCACC GCTGCCTGCG
GTGGCCCGGC GCTGCGGCTA TTCGTCGGAG TTCGCCTTCG CCAAGACCTT CAAGCGCGAG
TTCGGCGTCC CGCCGGGCGC ATTCCGGCGA GAGGGACGGC CACCATCCGG TTCACCCGCG
GACGCCTCGC CACTGACGGC AACCTTGTCA TGA
 
Protein sequence
MDVLSDAVTM LRTGRPHSNS NRLWAPWGIR FPPTDGAGFH IVLMGTCWLL RAGARPLRLA 
AGDIVLLPRE PGHALADDPA SPLTDFRADP HGPIPSEHAD GYGGSGPGGR TVTELLCGAY
MFDRFRLHPL LADLPDVIHL PARVGHHPRL RAAVDLLGAE LAEPRAGAAA SMSALLDLLL
LYMLRAWFDD HSTGSSTGSS TGWSAALADP AVSAALRAMH AEPEMPWTVR ELGARVGLSR
TVFAQRFTAL VGKPPLAYLT WWRMTMAARL LRETDSPLPA VARRCGYSSE FAFAKTFKRE
FGVPPGAFRR EGRPPSGSPA DASPLTATLS