Gene Franean1_3605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3605 
Symbol 
ID5671974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4266218 
End bp4267654 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content69% 
IMG OID641242491 
Producttranscriptional regulator, TrmB 
Protein accessionYP_001507911 
Protein GI158315403 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCAGA TGATCGCCTT CCTGCGCGCC GCTGGAGGCG ACACCACCGA GGTGGAGGTC 
AAGTCTGCCG CCGGCGGGTT ACCCGCCTCG TTGACCTCTA CGATGAGCGC GCTGGCGAAT
CAGCCCGGGG GCGGGACCAT CATCCTGGGG CTCGACGAGC GGGCCGGGTT CCGCCCCGTC
GAGCTCAGCG ACCCGCAGAT GCTCAAGCAG GGCCTGGCTG CCAGGGCCCG GGCATTCACA
CCGCCGGTCC GTCTCACGAT CGACGACGGG GAGGTCGACG GGGCCACGGT CGTGGTTGCG
CAGGTCCACG AGTGCGACCG CTCGACCAAG CCCTGTCGCG TCACCGCGAC CGGCAGGGCG
TACCTACGTG GCTACGACGG CGACTACGCC CTGTCCGACA TGGAGGAACA GGGGTTTCTG
GCCGCTCGCC AGCCACCGCT GTTCGACCGT TCACCTGTCG AGGACGCCAC CATCGACGAC
CTGGACACCG AACTCGTCGA TACTTTTCTA CTCGCCGTCC GCGAACGCGA CCCCGCCGGG
CTCGGCCGTT TTCCCGACGA CACGGAGCTC CTGCGCCGGG CTGGAGTCAC GATGGAAGGC
GGGCAGCCGA CCGTCGCGGG ACTGCTCGCT CTCGGGGTCC ATCCCCAACA ATGGTTCCCT
CGCTACGTCA TCCAAGCCGC CGCGCAGCCC TTGCCCACAG ACTCCGCCGC AACGCGGGCC
CGCAACCAGG TCACCATCAG CGGGCCGGTC CCGCGGATGC TTGACGCGGC GCTACTCTGG
GCCCGACGTA CCTTCGACAC CGCCATCGTC GCCGAGATGG ACGGTAGCGT TCGTGACCGT
CCGATCTACC CACTCGTCGC CTTCCGTGAG TTGGTCGCCA ACGCACTGAT CCACCGCGAC
CTCGACCACT GGTCCGCCGG GCTGGCCGTC GAAGTGCGGC TTCTGCGGGA CCGCCTGGTA
GTGGCCAATC CCGGAGGCCT GTACGGCATC ACCGTCGACC GACTCGGGCG CGACGCGGTG
ACCTCCGCCC GCAACGCCAG ACTGGTCGCG ATCTGCCAGC ACGTCCGCTC CCCGCAAACC
GAAGCTCGGG TCATCGAAGC CCTCGCCAGC GGAATTCCCA CCGTCACCGA GGCCCTCGCC
GACCATGGCC TGCCGCCAGC CCACTACGTG GACAGCGGCA TCCGGTTCAC CGTCGTCCTC
CACCAGTTCG CGACCGCCCC GCCCGCGGCG ACCGCCGAGC CCCCAATGGG CGCCACAGAG
CGTCGCGTCT ACCAGACCCT GACCCGTCCG GGAAGAACAG TCAGCGACCT CGCCGAGGAG
CTCGGGCTGT CCGCTCCGAA CATCCGCAAG GCACTGCGAA GCCTGCGCGG TCGCGGGCTG
ATCCTTCAAC TCGGCGGCAG AGGCAAAGCC ACCACCTACC AGCGGACGGA CTCATAG
 
Protein sequence
MSQMIAFLRA AGGDTTEVEV KSAAGGLPAS LTSTMSALAN QPGGGTIILG LDERAGFRPV 
ELSDPQMLKQ GLAARARAFT PPVRLTIDDG EVDGATVVVA QVHECDRSTK PCRVTATGRA
YLRGYDGDYA LSDMEEQGFL AARQPPLFDR SPVEDATIDD LDTELVDTFL LAVRERDPAG
LGRFPDDTEL LRRAGVTMEG GQPTVAGLLA LGVHPQQWFP RYVIQAAAQP LPTDSAATRA
RNQVTISGPV PRMLDAALLW ARRTFDTAIV AEMDGSVRDR PIYPLVAFRE LVANALIHRD
LDHWSAGLAV EVRLLRDRLV VANPGGLYGI TVDRLGRDAV TSARNARLVA ICQHVRSPQT
EARVIEALAS GIPTVTEALA DHGLPPAHYV DSGIRFTVVL HQFATAPPAA TAEPPMGATE
RRVYQTLTRP GRTVSDLAEE LGLSAPNIRK ALRSLRGRGL ILQLGGRGKA TTYQRTDS