Gene Franean1_6286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6286 
Symbol 
ID5674605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7634331 
End bp7635749 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content69% 
IMG OID641245138 
Productputative HTH-type transcriptional regulator 
Protein accessionYP_001510534 
Protein GI158318026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCGCC GTGCACAACC CATGACACAG TGCGTGACAT GGCTGTGGAA GGGTGAGAAC 
TTCTTCTCAG GGGTGAGACA CCTATGTCAC TGCGGCCAGA CGAGGGGGAC CGCCGTGCAT
GATGCCGACA CGCTGAAACA GCAGCTTGCG GCACGTTTCA GGCAGCTCCA GGGCGAGCAC
GGCCTCTCCG CGGTGCAACT CGAACAACGC ACGACGTACG ACCGGAAGTA CGTCGGGTGG
CTCCGGAATC GTGGTCGTCT TCCAGCACGT CATGTCCTGG TCGCGCTCGA CGAGGTGTTC
GGGACCGGTC AGGAACTCGC CGACCTGGGC GACGAGATCC GCGCAGCGCA GAACGACGAA
CGGTTGCGGC ACAAGTCAGG CAAGCTGCGG CAGGAACCGG TGCTGGACCA TGAGGGGGTG
GACCCGACGA ACCGGCGGGA ACTGCTTCAG ACTGGTGCGA TCTCGGCACT CGCCGGGGTC
GCGGCCGAGC GCAGCGTTGA GGTCGCCAGT GCCGACCTGG CGCCCCCGAA GCTGATCGAA
ATCGAAGAGG ATATTGATCG GTTCGCGGCC GAATACACCC TGCACCCCCA CGAGATCTTG
GCCCCGCAGG TGGTCCAGCG CTGGCGGCAG GTAGACGCAG CTCTCGGCCG CCGCGGGTCC
TGGGCGGCCC GGAGGCGGCT CACCGCCGCC GCTGGACGTC TCACCTACTA CCTGTCCCGG
CTGGCATTCA ACACCGGAGA TTTCGGGTCC GCGGTGCGTC TCGCAGCCCT CGCCGATCAG
TACGCCGCGC AGGTCGGTGA CCAGGTCGTG CAGGCGTCGG TCGCCGGGAT GACCTCCGGC
GTGGCGTTCT ACCGGCACCA GTACGACGAC GCCACGGCAG CTTTCACCGC AGCGGATCCG
CCGTACCTGC GCGCCCGGAA CGCCGCCTAC CGGGCGAGGG CCTACGCCGC GACCGGCAAT
GCGGAGCTCG CACAGGCCGA GCTTGACACC ATGTGGTCAT CACAGCTCGC AGGCACCCCC
CAACCGGGAG ACCTGCCACT CAGCATCGCC GGCGCGGAGA TGTTCACCGC CGTGGCGCTC
GTCCGTCTGG GCGACGGCAA ACGCGCCGAG CCCCACGCCC GGGAGTCCGT TGCAGGGCAC
GAGGCATCCG GGCCCGCGGC GCACCCCGAA GAGTTCGGCC ACGCGCTGTG CATGCTCGCG
AATACGCTTC TTCTGCGCCC CCACCCGGAG CCGGAGGAAG CGGCAGCGCT GGGACGGCGA
GCGCTCACCG TCCTGAATGG CCATCCCACG CACACCGTGG CCGTCCGCGC ACGTCTGCTC
GGCGAGGATC TCCGCCCGTT CGCGGCGGTG CCCGCTGTCG CCGAGTTCCG CGAGCTCGCT
CATACCGCTG GCCGCCCGGC GCTTACCGGG GCACGGTAG
 
Protein sequence
MSRRAQPMTQ CVTWLWKGEN FFSGVRHLCH CGQTRGTAVH DADTLKQQLA ARFRQLQGEH 
GLSAVQLEQR TTYDRKYVGW LRNRGRLPAR HVLVALDEVF GTGQELADLG DEIRAAQNDE
RLRHKSGKLR QEPVLDHEGV DPTNRRELLQ TGAISALAGV AAERSVEVAS ADLAPPKLIE
IEEDIDRFAA EYTLHPHEIL APQVVQRWRQ VDAALGRRGS WAARRRLTAA AGRLTYYLSR
LAFNTGDFGS AVRLAALADQ YAAQVGDQVV QASVAGMTSG VAFYRHQYDD ATAAFTAADP
PYLRARNAAY RARAYAATGN AELAQAELDT MWSSQLAGTP QPGDLPLSIA GAEMFTAVAL
VRLGDGKRAE PHARESVAGH EASGPAAHPE EFGHALCMLA NTLLLRPHPE PEEAAALGRR
ALTVLNGHPT HTVAVRARLL GEDLRPFAAV PAVAEFRELA HTAGRPALTG AR