Gene Francci3_0858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0858 
Symbol 
ID3904340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1000133 
End bp1001626 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content70% 
IMG OID637878191 
ProductXRE family transcriptional regulator 
Protein accessionYP_479971 
Protein GI86739571 
COG category[K] Transcription 
COG ID[COG1396] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.359757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGTC ACACTTGGCT ATGGACGCCG CCGCGTTCTA CTCCCACCAG CGTTGGTGTG 
GGCCAACTCC TACGCGCGTA CCGGCAGGCC CACGGCCTCA CGCAGCAGCA GCTCGCCGAC
CGTCTGGGCT TCGACCAGTC CTACGTATCC AAGGTGGAGA GCGGCCGGCG GGCTATCCAC
GACATCTCGA CGCTCCGTCA CATCGCCCGC AATCTGGCGC TGTCCCCCGA GGACGTCGGC
CTGGCCCCGG GATCGATGAC GGACCGTCGC CGGGAGACAG CGCCGCGCAG TCCGGCGGCC
GAGCAGGCCG CCGCGAGCCA ACGCGCGTGG CGGCTCACCC GCGATCACCT CAACCACCAT
CGGATCAGCC TCGCCCGCGC GGCGGTGCGG CTCTACCCGG AAACCGACCG ACTCGGCAGC
GGGCTACTGG CCCGACCCGG CTGGGTGTGG GACACCCCGG TGGACATCGA CGATGTCACC
CTTGGGTGGC ACGATAGCGC CGAGGCCCCC ACGATCAGGG GAGACGAACC CGAGACGGCC
GGTAGCCGGC CCCTGCTCGG TGGAGGCGGT CAGCAGAACC GCTACCAGCG CTACACGCGG
GCGATGCGGG ACCTCGACCG GCCGACCCTG TTCGAGAACC GGCTCAGTTT TCGCCTGCTC
GATATCGGCG GTGACGGGAC CGCGGGACCG TCAACGGACA ACCATCCGTC GCTGAGCTTC
GCCCACACGA CGTACTTCGA CGCCGTCGAC GTCTGCGAGA CCGTCGCACA CGAGACGGCG
GCCGCGATGG CCGGCAGCGG ACTGTCCTGG CCCGCGCTGC CCTTCCGGGC GCGCATCGGA
GATCCGTTCG ACTTCACCCG GCGGGCGGCC CTGCCGTCCA TCAACACGCT CACGATCCGC
CACGATCGCG GCGGCCAGGC GAGCTTCTTC CTGCACCGGC GCAGCGCGGG CTCGGTCGCG
ACCGCGGGCG GGGTGTACCA CGTCATCCCG GCCGGGGTCT TCCAGCCGTC GGGCATCACC
CCGTTCCACC ATGAGGCCGA CTTCGACCTG TGGCGCAACA TCATGCGTGA GCTGAGCGAG
GAGCTGCTGG GCAACCCGGA GCACGACGGC AGCTCGTCCA GCCCGATCGA CTATGACACC
GACGAGCCGT TCCGTTCGTT CGAGCAGGCG CGTCGGTCCG GCGCCATGCG GGTGTCGTAC
TTCGGGATCG GGCTGGACGC GCTCACTCTG TTCGGCGAGA TCCTCACCGT CGTCGTGGTG
GACGCCGACG CCTTCGACCG GCTTTTCGGC AGCATGGTCC AGACCAATGC CGAGGGATCG
GTCGTCACCG CGGGTCCGAA CCGGTCGGTG ACCGAGGGCA TCCCGTTCAC CCACTCGGCG
CTGCGCCGTC TCGTCGACAC CGAGCCCATC GCCCCGTCGG CCGCCGCCTG CCTGGAACTC
GCCTGGCGCC ACCGGGAGAC CCTCCTCGGG ATGCGCATGC GGGCCGCATC CTGA
 
Protein sequence
MTGHTWLWTP PRSTPTSVGV GQLLRAYRQA HGLTQQQLAD RLGFDQSYVS KVESGRRAIH 
DISTLRHIAR NLALSPEDVG LAPGSMTDRR RETAPRSPAA EQAAASQRAW RLTRDHLNHH
RISLARAAVR LYPETDRLGS GLLARPGWVW DTPVDIDDVT LGWHDSAEAP TIRGDEPETA
GSRPLLGGGG QQNRYQRYTR AMRDLDRPTL FENRLSFRLL DIGGDGTAGP STDNHPSLSF
AHTTYFDAVD VCETVAHETA AAMAGSGLSW PALPFRARIG DPFDFTRRAA LPSINTLTIR
HDRGGQASFF LHRRSAGSVA TAGGVYHVIP AGVFQPSGIT PFHHEADFDL WRNIMRELSE
ELLGNPEHDG SSSSPIDYDT DEPFRSFEQA RRSGAMRVSY FGIGLDALTL FGEILTVVVV
DADAFDRLFG SMVQTNAEGS VVTAGPNRSV TEGIPFTHSA LRRLVDTEPI APSAAACLEL
AWRHRETLLG MRMRAAS