Gene Franean1_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1647 
Symbol 
ID5670049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1967102 
End bp1968202 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content76% 
IMG OID641240565 
Producthelix-turn-helix type 11 domain-containing protein 
Protein accessionYP_001505991 
Protein GI158313483 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.242588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.121155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTGCCA GCCGGCTCCT CTCCGTCCTC CTGCTGCTGC AGACCCGCGG CCGGCTGACG 
GCCCGCGAGA TAGCCGCCGA GCTGGAGGTG TCGGTCCGCA CGGTCTACCG GGACCTGGAC
GCGCTCGCCG AGGCCGGCGT CCCGGTGCTG GCGGAGCGTG GGGCCACCGG CGGTTACGAG
CTGCTCGCCG GCTACCGCAC CCGGCTGACC GGGCTGACCG CGGACGAGGC CGACTCCCTG
CTGTTCGCCG GGCTCCCGGA CGCCGCCGCC GAGCTCGGTT TCGGCGCGGT AGTCGCCGCC
GCCGAGCTCA AGCTGCTCGC CGCGCTCCCG GCCGAGGCCC GTGAGCGGGC ACTTCGGGTG
CGGGAGCTGT TCCACCTCGA CGCGCCCGGC TGGTTCCGTG CCGCCGAGCC GGTGCCGCTG
CTCGCCGAGG TCGCCGGTGC GGTGTGGGGG CGGCGGCGCA TCCGGATCAC CTACCTGCGC
TGGCGCGCAC CGCGCCGGGT CGTCCGCGAG CTGGAGCCGC TCGGTGTGGT CCTCAAGAGC
GGCACCTGGT ACGTCGTAGC CGCCGCCTGC CCCGGCGACG GCCGGGCCGA CATCGCGGAC
GCTCCGCCCG CGGAGACACC GGAGACACCG GAGACACCTG AGGCGGCGGC GGAGGCGTTC
GAGGCGTCCG TGCGGGTGTA CCGGGTCGCG AAGATTCTCG GTCTCGAGGC GATGCCGGAG
ACGTTCGAAC GGCCGGAGCG GTTCGACCTG GCGGCCTATT GGGAGCAGTG GACCGCCAGG
TACGAGGCCG GCGTCTACCG GGGGACCGCC ACGGTGCGCC TGTCGCCGGA GGGCCGGCGG
ATGGTTCCCT TCCGGCTCGC CCCGGCGGTG GCGCGGGCCG TCGAGCAGAC CGCGGGCGAT
CCCGACGCCG ACGGCTGGGT GCGCGCCGAG CTCCCGATCG AGTCGGTCCG GCACGCCCGG
GGCGACCTGC TCCTCCTCGG CCCTGACCTG GAGGTGCTCG ATCCCCCGGA GCTGCGGGCG
GCGATGGCGG ACGCGGCCGC GGGCCTGGCG GCGCTCTACA GCCCGCCCGC GAACCCGCCC
GCGAACCCGC CCGCGTGCTG A
 
Protein sequence
MRASRLLSVL LLLQTRGRLT AREIAAELEV SVRTVYRDLD ALAEAGVPVL AERGATGGYE 
LLAGYRTRLT GLTADEADSL LFAGLPDAAA ELGFGAVVAA AELKLLAALP AEARERALRV
RELFHLDAPG WFRAAEPVPL LAEVAGAVWG RRRIRITYLR WRAPRRVVRE LEPLGVVLKS
GTWYVVAAAC PGDGRADIAD APPAETPETP ETPEAAAEAF EASVRVYRVA KILGLEAMPE
TFERPERFDL AAYWEQWTAR YEAGVYRGTA TVRLSPEGRR MVPFRLAPAV ARAVEQTAGD
PDADGWVRAE LPIESVRHAR GDLLLLGPDL EVLDPPELRA AMADAAAGLA ALYSPPANPP
ANPPAC