Gene Francci3_2288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2288 
Symbol 
ID3904822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2668112 
End bp2669722 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content73% 
IMG OID637879619 
ProductFis family transcriptional regulator 
Protein accessionYP_481385 
Protein GI86740985 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00288923 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000159352 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCCAAA CTGAGACACG CGCGGCCGTG CCAGGGACGA GGCTGGCGCC TACGCTTATG 
ACCTGGATCA CAGCGGGGCT TTCAGGGAGT GTCATGCCCG ACGACGTCAT CCCGGCACGT
GCCCGCGGGT TCGCGGCCGA GGCGCGCGCG GGGCGCGACC ACACCCGGTC GGACGTCTCC
TGCCGGCTGA TGGCCTCCTG GCAGCGCAGC GAGGAGTACG GCGTCTCCCT CGACGACGTC
GACCCGGTCT TCTCCGGCAC GATCGACCAG AGCTCGCTGT TCTACGACAG CGGCCGCGAG
GTGCTGGCCA GCCTCCACCG GACGCTGGCC GCGGAGCCCG TCAGCCTGAT GTTGACCGAC
GCGGACGGCT TGGTGCTCAA CCGTCTCAGC GGGGACACCA GCCTCCTGCG CGCGCTGGAC
CGGGTCCACC TCGCGCCGGG CTTCTCCTAC GCCGAGCGGG TGGTGGGTAC GACGGGACTC
GGGCTCGCGC TGGCGGACCG CGCCCCGTCC CTGGTGCGGG CCGAGGAGCA CTACGCCGTC
GGGCTGTGCT GCTACACCTG CGCCGCGGCG CCGGTGCTCC ACCCAGGGAC CGGACGCCTC
GAGGGCTCGG TCAACCTGAC GACCTGGTCG GAGTCGTCGA GCAACCTGCT GCTCGCGCTG
GCTGAGTCCG CCGCGCAGCA CACGACGGAC CTGATGCGGT TGCGGTCCGG GGGCGTCACG
GGTGGCCGCC CGCGCCCACG CGGCGAGGTG TTCCGGGTGG AAAGCCCGCG CGCGGAGCCC
GGGGCCGGCA GTCTGCACGA CCTCTCCGGC TCCTGGCGCC GTGCGGTGTC ACTGGCAGAG
GCGGGCCTGC GCGACGGGCG GGTCGTGGCC TGTGTCGGCG AGCCGGGCAG TGGCCGTACG
ACGGCGCTCG CGCAGGCCCT GCGCCGGGCC TTTCCGCGCT ACCGCATCCT GGCCGCGAGC
AACCCGGCAG CGGCGGACGT CGAGCCGTGG CTGTCCCTGT GGACCCCGGA GCTGACGAAG
GCGAGCACGG CGGTCATCGT GCGCGACGTC GACCTGCTCC CCCTGTGGGT GGCCGAACAG
GTGCGGGACC GGGTGCTCAG GGCCCGGGTC GAGGCGCGAT CGGGCGCAGC CGACCCGGCG
GGCTGCCTAC CCTTCGTGAT CACGGTAGAG CGGTTCGAGG ATATCCCGGC CGCACTGCGC
GCGATCGTTG ACGGGATCGT CCCGGTCGCA CCGCTGCGCC AACGGCCCGA GGACATCGGG
CCGTTGGCGC GGGTGGCGGC CCTGCGGGCA CGGGGCCGCG AGGTGGATCT GACCCCGGCG
GCCGAACGCG CCCTGTCCGA CCATCGTTGG CCGGGCAACG TCGAGCAGCT GATGCAGGTC
GTCAAGAAAC TGGCTCGCCG CCACGACCCG ATCGACGTCG GACACCTCCC GGCCGAGGTG
CTCTCGCACG GCCGCCACCG GCTGACGCGA CTGGAGACGT TCGAGCGGGA CGAGATCGTG
CGGGCTCTGA ACGACCCGTC CCTCACCATG GCCGAGGCAG CCGAGCGGGT CGGCCTGAGC
CGGTCCACGC TCTACCGGCG GATCGCCCAG TACGGCATCC GGGTCAGGTA G
 
Protein sequence
MSQTETRAAV PGTRLAPTLM TWITAGLSGS VMPDDVIPAR ARGFAAEARA GRDHTRSDVS 
CRLMASWQRS EEYGVSLDDV DPVFSGTIDQ SSLFYDSGRE VLASLHRTLA AEPVSLMLTD
ADGLVLNRLS GDTSLLRALD RVHLAPGFSY AERVVGTTGL GLALADRAPS LVRAEEHYAV
GLCCYTCAAA PVLHPGTGRL EGSVNLTTWS ESSSNLLLAL AESAAQHTTD LMRLRSGGVT
GGRPRPRGEV FRVESPRAEP GAGSLHDLSG SWRRAVSLAE AGLRDGRVVA CVGEPGSGRT
TALAQALRRA FPRYRILAAS NPAAADVEPW LSLWTPELTK ASTAVIVRDV DLLPLWVAEQ
VRDRVLRARV EARSGAADPA GCLPFVITVE RFEDIPAALR AIVDGIVPVA PLRQRPEDIG
PLARVAALRA RGREVDLTPA AERALSDHRW PGNVEQLMQV VKKLARRHDP IDVGHLPAEV
LSHGRHRLTR LETFERDEIV RALNDPSLTM AEAAERVGLS RSTLYRRIAQ YGIRVR