Gene Franean1_4238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4238 
Symbol 
ID5672593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5045686 
End bp5046762 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content75% 
IMG OID641243111 
ProductArsR family transcriptional regulator 
Protein accessionYP_001508528 
Protein GI158316020 
COG category[K] Transcription 
COG ID[COG0640] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.499229 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTCG TGATCGTGTT GGACGGCGCC GCCCCCGGCC GGTTCAGCGT CGCCGTCTCG 
CCGCTCGCGG AGCTGGCCGC CTGCCTGCAT GTGCTGACCG GGTCCGCGCA CCACACCGAG
CACGCGGCCT GGGCCGACCA GGTCACCCGC ACCGCCCCGC CCGCCTTCCG CGCCGGGCTG
GGCCGCTTCG CGCCGCTGTG GACGGCGCTG CGGTGGCGCG CCTTCTACCC GGGGCTCGAC
GGTCCATCCC CGGCCGCCGC GCCGCTGGCC GGCCTCGGGA TCGACCGGTT CGCCGAGCTC
ACCGCATACG CCTGCGCCAG CGGGTACCGC GGCTTCGATT TCAGCCAGGT CTGCCACGAC
CCGGGGCAGG CCGCCGTGCT GCGTCATGCC GCCGCCCGGC TGCCGGAGCC GCACCTCGGC
CTGGCCGAGG ACCTGCTGCG CGACCCGGAG GCGCTGCGCG CGGACATCCT CCGCTTTCTC
GACCTGTGCG GACGGGTGTT CTTCGGCGGG CTCTGGGCGC AGACCGCGCC CGTGCTCGAC
CGGGCCGCCC ACCTGGTGCG GCGCCGTCTC GCCGACGGTG GGCCCGCGCC GGCCCTGGTC
TCGCTCAGCC CATCGAGCGC GCGTCTCATC ACGCCGTCGG CCGGCCCCGC CCGGGTCGTC
TTCGACAAGG TGCACCACGC GGTGATCAAC CCGGCCCGGA CCCCACTGCT GCTAATCCCC
ACCCGCTACG GTGCCCCGCA CCTGCTGGTG AAGAACGAGC CAGGCCTGCC CCCCGTCGTC
CACTTCCCGG TCGAGGCGCC GGAGGTCGGC GTCACCCTGG CCCGCGCCCG TCTGCTGGCA
CTCACCGATC CGAGCCGGGT GCGGCTGTGC CGGCTGATCG CGCGGCAGGC CATGACCACC
GCAGACCTGG CGGACCGGCT GACGATGACC CGCCCCCAGG TCTCCCGCCA TCTGCGTGCC
CTGCGCGAGC TGGGGCTGGT GCGGATGGAG CGCCACGGGC GGCACGTCCT CTACGAGCTC
GACGTCGGCG CGGTCGGCCG CATCGGGCGA GATCTGGCGA CGGCCCTGCA GTACTGA
 
Protein sequence
MSVVIVLDGA APGRFSVAVS PLAELAACLH VLTGSAHHTE HAAWADQVTR TAPPAFRAGL 
GRFAPLWTAL RWRAFYPGLD GPSPAAAPLA GLGIDRFAEL TAYACASGYR GFDFSQVCHD
PGQAAVLRHA AARLPEPHLG LAEDLLRDPE ALRADILRFL DLCGRVFFGG LWAQTAPVLD
RAAHLVRRRL ADGGPAPALV SLSPSSARLI TPSAGPARVV FDKVHHAVIN PARTPLLLIP
TRYGAPHLLV KNEPGLPPVV HFPVEAPEVG VTLARARLLA LTDPSRVRLC RLIARQAMTT
ADLADRLTMT RPQVSRHLRA LRELGLVRME RHGRHVLYEL DVGAVGRIGR DLATALQY