Gene Francci3_0992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0992 
Symbol 
ID3905848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1178211 
End bp1180655 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content70% 
IMG OID637878325 
ProductLuxR family transcriptional regulator 
Protein accessionYP_480104 
Protein GI86739704 
COG category[K] Transcription
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG2197] Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTGTGTGT CCGGACGTGG AAGGTCGAAC CTGCCGGCGG AACTAACCAG CTTCGTCGGT 
CGCAGACAGG AGCTGGCAGA GGCATTGCGG TTGCTGGCCG GCGCCCGGTT GCTCACGCTG
ACCGGGCCCG GGGGGGTCGG GAAGACCCGC CTCGCCCTTC GGCTCGCGGC GGACCGGCAG
CGCGCGTTCC CCGACGGTGT CTGGATCGCC GAGTTGGCCG ACCTCGACGA CCCGTCACTG
GTCGTCGACG CGGTCGCCGA GGCGATGGAC CTGCAGTTCC TCTCCGGGCG ATGGACCGCG
GCGACGCTCG CCGCGCACCT TGCTCCGCGC AGGATCCTGC TCGTGCTCGA CAACTGCGAG
CACCTCGTGG ACACGTGTGC CCGGCTCGTC AACACCCTGC TGCGCGCGGG CCCGGACCTG
CGGGTCATCG CAACCAGCCG CCACCTGCTC GGGGTGGCGG GGGAACGGGT GCTGAAGGTG
CCCCCGCTGA GAGTACCCCC GGTGCCGGCC CCGGGCGCGG ACCAGTCCGC GTTGGCTGGC
GGGCTCATCC AGTACGAGGC GGTGAGCCTG CTGGCGGAGC GCGCGGCGGC GGTACTGGGG
GAATTCAGCC TCGACGAGGG CAACTGGCGG ACCGTCGCGT CGCTGTGCCA TCGCCTGGAT
GGCATCCCGC TGGCGATCGA ACTGGCGGCT GCTCGGCTCC GGACGCTGTC CCTGGAGCAG
ATCCTCGATC GGCTCGACAA CCGCTACGCC CTGTTGACGC GAGGCAACAG GGCCGGACCC
CGCCGCCAGC AGACCCTGCG CGCGCTGATC GACTGGAGCT TCGGCCTGTG CAATCCCGAG
GAGCAGCAGA TGTGGGCGTG CCTGTCGGTG TTCGCCGACG GGTTCACCTT GGAGGCGGCT
GAGCAGGTGG AGGGCGCCAG GTACCCGCCG GCAGTCGTGC TGGACGTGCT GAGCGGGCTG
GTTGACAAGT CGATTCTGGT CCGTGACGAC GACGAGGGGA TGGTCCGGTT CCGGATGCTG
GACACCATTC ACCAGTACGG CCAGGACCGC CTGCGCGAGT CCGGTCGGGA ACAGGACGTG
CGCCGTCGGC ATCTCGCCTA CTACCGCCGG ATGGCGGCCC TCGCCGACGC TCAGTGGTGC
AGCCCGGCCC AACTGGACTG GATCCGGCGG ATCCGGAACG AGCAAGCGAA CCTACGGGTT
GCGCTGGACC TCTGCCTCAC CACACCGGGC CTGGCGGAGA CCGCTCTGTC CATCATGGTG
GCCTTGTGGC AATACTGGGC CGCCATCGGG CTACTCAGCG AGGCGCGCTT CTGGTTTGAT
CGGGGGCTGA GCAAGGCCAC GGAGCCGGAC GGGATCCGGG CCATCGGTCT GCAATCGGCC
GCCCACGCCG CGTCCCTCCA GGGCGACATC ACCGCGGCGG CGTCGCTGCT GGCCGAGGCT
CGCGATCTGG CCGCCAGGCT GCACGAGCAG CGGGTGCTCG CACGCATCGC CTGCGTCGAG
GGCAGGCTCG CCGCCATCAG CGGGGACACG TCGGCGGCGG TGGAGCGCAG TCAGGACGCC
CGCCGCCGGT TCACTGACCT CGACGAGCCG CTCGGTCTCG CCCAATCACT GCTCTATCTC
GGGCTGGCCC ACGGAATCCG CGAGGAACAG GACGCGGCGG CGACCGTCTT CGCGGAATGC
GTGGCGTTGA CCGAGCGCTT CGGCGAATGC TGCTTCCGCT CGTTCGCTCT GTGCGGCCTC
GGTCTGGCGG CCTGGCAGCT AGGCGAGTTG GAGCAGGCGA TGGAGCACCA TGCCGCGGCC
ATCTCCCTCA AATACGCGTT CCACGACCAT GTCGGCATCG CCGTCGCGCT GGAGCAGCAG
GCGTGGGTCA TGGCCTCGGC CGGTCGGCAC CACGAGTCCG CGGTCCTGCT CGGCGCGGCC
GGGCGGATCT GGCGTGAGAC CGGGGCGTCG ATCGCCGTCT TCGGGCTCGC CTCGTTCCAC
CACCGATGCG AGGCGTCCCT GCGACGCGTC CTGGATCAAC GGAAGCTCGA ACGCGCCCTA
GCGCAGGGCG CGGAACTCGA CCTGGACGAG GCCGTCGCCG CGGCTACCCA GGGCACCGGG
ACTCCCGTCG CCCACCCCGT CGTTCCCGCC GGCGAGCCGG AGGTGCCCCT GACCCCTCGG
GAGCGTCAGG TGGCGGACCT CGTCGCCCGC GGTCTGAGCA ACAAGGAGAT CGCGAGCTAC
CTGGTGATCG CCCAGCGCAC GGCGGAAGGT CACGTCGAGC GCATTCTCGG CAAGCTCGGT
TTCACCTCAC GGGCCCAGAT CGTCGCGTGG GTTAAGGACA GCAGCAGGAG CCCCCAGGAA
CGTACGTGTG AGCCGCAGCG ACATGTTGGC GAGCGCGAGC GGAGCGCCTC CACCTCACCC
CCGCCTCCTC ATGGAGAGTC TGGAGGCGTA CAACCTCACG GGTAG
 
Protein sequence
MCVSGRGRSN LPAELTSFVG RRQELAEALR LLAGARLLTL TGPGGVGKTR LALRLAADRQ 
RAFPDGVWIA ELADLDDPSL VVDAVAEAMD LQFLSGRWTA ATLAAHLAPR RILLVLDNCE
HLVDTCARLV NTLLRAGPDL RVIATSRHLL GVAGERVLKV PPLRVPPVPA PGADQSALAG
GLIQYEAVSL LAERAAAVLG EFSLDEGNWR TVASLCHRLD GIPLAIELAA ARLRTLSLEQ
ILDRLDNRYA LLTRGNRAGP RRQQTLRALI DWSFGLCNPE EQQMWACLSV FADGFTLEAA
EQVEGARYPP AVVLDVLSGL VDKSILVRDD DEGMVRFRML DTIHQYGQDR LRESGREQDV
RRRHLAYYRR MAALADAQWC SPAQLDWIRR IRNEQANLRV ALDLCLTTPG LAETALSIMV
ALWQYWAAIG LLSEARFWFD RGLSKATEPD GIRAIGLQSA AHAASLQGDI TAAASLLAEA
RDLAARLHEQ RVLARIACVE GRLAAISGDT SAAVERSQDA RRRFTDLDEP LGLAQSLLYL
GLAHGIREEQ DAAATVFAEC VALTERFGEC CFRSFALCGL GLAAWQLGEL EQAMEHHAAA
ISLKYAFHDH VGIAVALEQQ AWVMASAGRH HESAVLLGAA GRIWRETGAS IAVFGLASFH
HRCEASLRRV LDQRKLERAL AQGAELDLDE AVAAATQGTG TPVAHPVVPA GEPEVPLTPR
ERQVADLVAR GLSNKEIASY LVIAQRTAEG HVERILGKLG FTSRAQIVAW VKDSSRSPQE
RTCEPQRHVG ERERSASTSP PPPHGESGGV QPHG