Gene Franean1_5332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5332 
Symbol 
ID5673666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6422023 
End bp6423201 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content69% 
IMG OID641244190 
ProductXRE family transcriptional regulator 
Protein accessionYP_001509596 
Protein GI158317088 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0406632 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCATA CCTGGCGGGT CTCACCCGGT TCGAAGCGGA CGATGCCGAA CAGCAGCGGC 
CCGGGATCAC CTCCACCTGC TCTGTCTGCT CTGTCGAACA TCACGTTCCG TCCTCCTGTG
TTGGATCGTG GGCGACGAGG CACCGGAACC GACCCGCGAA GGTGTTCGAC CTGCGGCGGA
TCTGGTGCCC GCCCACCGTC CAACCGGACG GCTTGTCACG GAAGGCCGCC GGCCGTATAC
GCCGAGGGTT CCGATCTATA CGGAGACCGG ATAGGCTCTG CGCGTCCGAT CAAGGACAGA
CGGTCCTGTA CCTACCGTCC GGAGGCGCCC GTGGTTTCCG TCCGCCGCCC GTTACCCCAG
GCCCCGCCGG GACTCTGGGA CCGGCCAGAG ATGGCCGACG CACTCGCCCG CCGTGACATC
GGCACCGTCT TCAAGATCTA CCGCCAGTGG ACCGGCGCGA CCCAGACACA GATCGCCGCT
GTCTGCGGCC TCCCGCAGTC CCACGTCAGC GAGATCTCAA CCGGCCGCCG CCAGGTCACC
AGCCTGGAGA TCTTCGAGCG CATCGCCGAC GGCATCGACA TCCCCCGGGG CCGCATCGGA
CTTGCCGAAA GACCCGGCGC CGTCCCTGAG CCACGGACCG AGCCCGGAGC GCCCGTCGTC
AACGTGCCCG GCGACATCGT GCACGTCTAT CCCAGCCGGA CAGCCGTCCC GTCCGAGCTG
TGGCGGACGC TGTTCGCCGG CGCGCGCCAC CAGGTCGACG TCCTCGTGAT CGCCGGCCTG
TTCCTCCCCG ACGGCCACGC CGACTTCACC ACCGTCCTCC GCCACAAAGG CGCGGAAGGC
GTCACGATCC GCTACGCACT CGGCGACCCC GAATCACCCG CCGTCGCTCT CCGCGGCGAA
GAGGAAGGGA TCGGCGACGG GCTCGCCGCC CGGACCCGGA TCACGCTCAC CTACCTCGCG
TCCCTCCGCG AGGCGCCCGG GATCGAGCTA CGGCTCCACG CCACCACGCT CTACAACTCC
ATCTACCGGT TCGACGGCGA CATGCTCGTC AACACCCACG TGTACGGCGC CCCCGCCGCG
CACTCCCCCG TCATGCACCT ACGCGCCCAG TCCGGTGGCC TGTTCGACCA CTACGCCGCC
AGCTTCGAGC GCATCTGGGC CACCACCGAA GGAGCCTGA
 
Protein sequence
MSHTWRVSPG SKRTMPNSSG PGSPPPALSA LSNITFRPPV LDRGRRGTGT DPRRCSTCGG 
SGARPPSNRT ACHGRPPAVY AEGSDLYGDR IGSARPIKDR RSCTYRPEAP VVSVRRPLPQ
APPGLWDRPE MADALARRDI GTVFKIYRQW TGATQTQIAA VCGLPQSHVS EISTGRRQVT
SLEIFERIAD GIDIPRGRIG LAERPGAVPE PRTEPGAPVV NVPGDIVHVY PSRTAVPSEL
WRTLFAGARH QVDVLVIAGL FLPDGHADFT TVLRHKGAEG VTIRYALGDP ESPAVALRGE
EEGIGDGLAA RTRITLTYLA SLREAPGIEL RLHATTLYNS IYRFDGDMLV NTHVYGAPAA
HSPVMHLRAQ SGGLFDHYAA SFERIWATTE GA