Gene Franean1_4931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4931 
Symbol 
ID5675745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5920068 
End bp5921561 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content71% 
IMG OID641243785 
ProductTetR family transcriptional regulator 
Protein accessionYP_001509201 
Protein GI158316693 
COG category[K] Transcription 
COG ID[COG1309] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.475051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.080695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGA ACGAGATCCT GCGACGGGCG AGCTACGGCC CCACCAGTCC GGTGGTGGGG 
GCACGCGGTT CACGCACCCG CACGCGCATC GTCGACACGG CGCTGGCGCT ATTCGAATCC
CAGGGCTTCC ACGGCACGTC GGTGGACGAC ATCGCGAAGG CCGCCGAGGT GTCCCGGGCC
ACGTTGTACC AGTATTTCGA GAGCAAGGAA CAGATCTTCG TCGAGCTGCT CGAGGAATGC
GGCGGCGCGC TCATGCGGGT GGTTCGCCGC ATCGGCCCGC TGGAACCCAG CGAGCTCGGC
TTCGACAATC TGCACTGGTG GCTCGGCGAG TGGGCCTGGG TGTACGACAG GTACGCGACG
ATGTTCGTGC AGTGGGCGAA CATCAGTTCG CCCGGCACCG CGATACCGCC GCTGGTCAGC
CGCTTCTCGG CCGCCTACCG GGAGCGGATC GCGCAGCGGC TGACCTCGTC GGGGGTGACC
GGCCTCGCCG CCGCCGACGC GGCGCTGACA CTGACCGCGC TGGTGAGCAG CTGTAACTAC
ACGCGCCATG CCGATGCGCC GCTCGGCCGT GAACACCCCG GCACCGGGCC GGCCGCCCGC
GGCACGGTGA TGGGCACCGA CTACCTGACC GACAACCTCG CCGTGCTGGT CCAGCTCATC
CTTTTCCCGC ACACCCCCGT GAGTGTTTTC GCCCAGCTGG GCCGGGAGAT CCCCACCCCG
CGCCCGCCGA GCACCGGAGA ACGCCTCTGG TCCGTACCGG CACCGGCGGC CGGGCCGCCG
CCGGCTTTGC CGCCGGATGG CCGTCTGCGG CAACTCAGCA GCCGCGCCAC GGTGACCGTT
CGGCGACTTC TCGACGCCGG GATCAGATGT TTCACCGAAA AGGGCTACCA CCAATGCTCG
GTCGACGACA TCGTCACCGA GGCGGGTTAC GCGCGCGGCA CGTTCTACAA GTATTTCGAC
GAGAAACTCG ACCTGCTGGT GGCGCTGAGC GACGAGGCGA TCGAGACGAT CACCGAGCTC
GACGGCCGGC TGCGGCGGAT CGGCCCGACT CTCGGGAGCG ACCCCGCCCA GCTACGGAGC
TGGCTCGGTG ACGCCGTCGC GTTCCACCTG CGGTACCTGG GCGTCACACG GGCCTGGCTC
GACCGGCGGC CGTGCCACCC ACGCCTCGAC GCGGCCCGCC GGCTCGTCGG CGAACGGCTG
CACACCGGCT ACACCGCCCT GCTGGGCCCG GCGCGCTGGT CGCATCCGCT TGATCCGCGG
GTCGCCAGCA TCGCGTTCTT CACCCTGCTG GAGCGACTGC CCGAGGCGAT GGTGGCGGCC
GAGCCGGACC GCCCGCTCAC GGAGATCGTC GACCTCGTCG CCACGGTGCT CGAACGCGCG
CACCTGTGCC CCCAGAGCGG CCCCCACGAC TCCCAGCCCG CCGGCGCCGA CCAAGACAGC
GCCCGGGCCG CCGCCGCGTC CGTCCCCTCG GGACGCCACC TCCCCCGGCC CTGA
 
Protein sequence
MSENEILRRA SYGPTSPVVG ARGSRTRTRI VDTALALFES QGFHGTSVDD IAKAAEVSRA 
TLYQYFESKE QIFVELLEEC GGALMRVVRR IGPLEPSELG FDNLHWWLGE WAWVYDRYAT
MFVQWANISS PGTAIPPLVS RFSAAYRERI AQRLTSSGVT GLAAADAALT LTALVSSCNY
TRHADAPLGR EHPGTGPAAR GTVMGTDYLT DNLAVLVQLI LFPHTPVSVF AQLGREIPTP
RPPSTGERLW SVPAPAAGPP PALPPDGRLR QLSSRATVTV RRLLDAGIRC FTEKGYHQCS
VDDIVTEAGY ARGTFYKYFD EKLDLLVALS DEAIETITEL DGRLRRIGPT LGSDPAQLRS
WLGDAVAFHL RYLGVTRAWL DRRPCHPRLD AARRLVGERL HTGYTALLGP ARWSHPLDPR
VASIAFFTLL ERLPEAMVAA EPDRPLTEIV DLVATVLERA HLCPQSGPHD SQPAGADQDS
ARAAAASVPS GRHLPRP