Gene Francci3_4158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4158 
Symbol 
ID3907123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4959583 
End bp4960794 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content60% 
IMG OID637881486 
ProductXRE family transcriptional regulator 
Protein accessionYP_483235 
Protein GI86742835 
COG category[K] Transcription 
COG ID[COG1396] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0178123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGAGA TCCGGGCTCT GGGCGACCGG GTCGCTCAAG TACGTGTACG CCGTTCGATG 
ACACAGACCG AGCTTGCCGA GCGTGCAGGC GTGTCTACCG ACCTGGTTAC GAAGCTGGAG
CAGGGCCAGC GTGACGGCAT ACGCATCTCT ACGTTGCACA GCCTTGCTAG GGCCTTGGAC
GTTCCTACCG CTACGTTCTT TGAGGTGGAG CAGGAGGAGA CGGTGAGCGA TAATGAGGCC
TTCATACCGC TTCGGAAACT GTTGCTCCCT GGACCTTCCA GCGGGCAAAC CGACGAACCG
GCCCTATCGC TCCAACCGTT GCGGCAACGT CTGGTGGCGC TGACCCAGGA CTACCATTAT
GCGCGGTATC CGCAGGCCGT TCGGACCGCT CCAGAACTGA TCGAGGACAT CACTGCCGCC
ACGGGCATAC ACCAGGAGGA GGACCAGAAG AGTATATATC GCCTGCTGGC GCACGCCTAC
ATTATGGCTG CCTCGATTCT CATCCAGCTT AGCGGGGAGG ACTTGGCTTG CGAGGCCATA
CGCCGGAGCA TGGAAGCGGC GGAGCAAGCC GGAGACCCGA TTCTCCGTGC AAGCGGAGTG
GTGTACTACC GGTGGGCATT CATCCGCCAA GGCCGATTCG ACGATGCCGA AAAGGTGGCC
GTCGACATGG CCACCGAGAT CGAGCCAAGC ATCATGTCAG CAACCCCCGA ACACCTTGCA
GTATGGGGGA GACTGCTGAC CGGCGCCTCT GCCGCCGCAG CTCGGAACAA CCGTCCGGAA
ACAGCGAAAG ACCTACTTTC GTATGCCCGT AGCTCAGCCG CGCGTGTAGC CGACGGAAAA
ATGGACTACG CTAAGTACTG GGCGGCGTTC GGACCGAGTC AAGTCGACGC AATCGAGGTC
GAAAATGCCA TGACGCAAGG CGATGCGCCC CGTGCATTGA CTCTGGCCCG GAGGGTCCGT
CGGACTGAGA ACATGCCACT GAGCAACTGG ACACGTCACC TTCTCGCTGT GGCCGAGGCA
CAAACGGCCA CAAGAGACTA CGCGAGTGCC ATTCAGACAG TTCAAGATGT CTACACTCTC
ACCCCGGAAT GGCTACGGGA GCAGCGCCTA GCCAGCAGAC TAATACGCGA CCTACTAGAC
GCTACGAGCG TCCGAAGAGC CCGAAAGACC GGTCTAGCCG ACCTGGCTAC ATTTGTGGGC
ATCAAGCCGT AG
 
Protein sequence
MTEIRALGDR VAQVRVRRSM TQTELAERAG VSTDLVTKLE QGQRDGIRIS TLHSLARALD 
VPTATFFEVE QEETVSDNEA FIPLRKLLLP GPSSGQTDEP ALSLQPLRQR LVALTQDYHY
ARYPQAVRTA PELIEDITAA TGIHQEEDQK SIYRLLAHAY IMAASILIQL SGEDLACEAI
RRSMEAAEQA GDPILRASGV VYYRWAFIRQ GRFDDAEKVA VDMATEIEPS IMSATPEHLA
VWGRLLTGAS AAAARNNRPE TAKDLLSYAR SSAARVADGK MDYAKYWAAF GPSQVDAIEV
ENAMTQGDAP RALTLARRVR RTENMPLSNW TRHLLAVAEA QTATRDYASA IQTVQDVYTL
TPEWLREQRL ASRLIRDLLD ATSVRRARKT GLADLATFVG IKP