Gene Francci3_3764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3764 
Symbol 
ID3906048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4513251 
End bp4515182 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content73% 
IMG OID637881090 
Productputative PAS/PAC sensor protein 
Protein accessionYP_482844 
Protein GI86742444 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.124369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.415801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCGA CGCCGCAGCC CGATGGTCAG GTACGCAGCG ACATCTGCGG GCCTCGGCCC 
GGCCTGCCGG ACCCGCCGGC ACTCGCCCGG TGGGAAGAGG CGCTGCTGTG TCTGCGGGTG
TTCGACACGG CGCCGATCAT ATTCGCGGTC TTCGACCGCG GCCTGCGCTA CCGGGCGGTG
AACCAGGCGG CTGCCAGGGC CTCCGGCCTG TCCGCCGAGG AGATGATCGG TGGGTATGGC
CCGGAGGTGC TCCCGGGCAT AGACCCTCTC GGCTGGGAGC GGCTCCGCCG GGTGTTGGCG
ACCGGTGAGG CGATCCTCGA CGAGGAGATC ATCGGAGCGA CCCCGGCGGA CCCGTCGAAT
CCGCGCATCT GGCGGGCCTC CTACTATCCC CTGCATGACT CGACCGGGGC CAGAATCGCG
GTCGGCGTGA CGGCGGTGGA CGTGACCGAC GAGCGTCGCG CGCAGGCCGA ACGTGACCTG
GTACTGCGGC GGTTGCGGCT GCTGAGCCGG GCGAGCGGGC TGCTCGGCGC GTCGCTGGAC
CTGTCGGCCA CCTTGCGGGA GATGGTCACG CTGGTCGTCC CGGAATTCGC CGACGCCTGC
GAGCTGTACC TCGCCGACGA GCCCTGCCCC CCGGGGGCCG AACCCGACCC GCCGCTGCTT
CGCCGAGCCG TGTGGGCACA CTCCCCCGAC CTGCCCCGAC CATCCCCCGA CCTGCCCCTC
CCCCCACCAG ACACAAGAAG CAACCTGACC GGACGCCAGG TCGGACGGGT ATTCCTCACC
CGCCAACCGG TCCGCCTCGA CCTGGACGAA CGGCTCCCCG ACGCACCCGA CGCACCCGAC
GCACCCGACG CACCCGACGC ACCCGACGCA CCCGACGCAC CCGACGCACC CGACGCACCC
GACGCACCCG ACGCACCCGC GAGCAGCGTG GCCGCCCTGT TCCGGCATTC CCACCTTCGA
TCCGCGATCA TCGTCCCGCT GCTGGTCGGC CCGCACTGCC TGGGAACCGT CGGGTTCGCC
GTCACCGCGG CCCGCCCCTA CCGCGAGCAG GACACCCAGA CCGCCACCGA ACTGGGCAGC
CGGATCGCAA CCGCGATCGC GAACGCCCAC GCCTTCGACC ACCAGCGGAC CGCCGCCCTG
ACCCTGCAAC GCGCCCTGCT GCCCCGCGAC ATCCCCACCC TGGACGACCT CGACCTGGCC
TGGCGCTACC AGCCCGGCAC CAGCGGCACC GAGGTCGGCG GCGACTGGTT CGACGTCATC
CCCCTGCCGG CCGGACGGGT CGCGCTGGTC ATCGGCGACG TCATGGGCCG CGGCCTGAAC
GCCGCCGCCG CCATGGGCCA GCTACGCACC GCCGCCCGCA CCCTCGCCCG CCTCGACCTG
CCCCCCGCCG CCCTGCTGAC CGAACTCGAC GCCGTCACCC GCAGCATCGA CACCATCGCC
AGCGTCGCCT ACGTCATCCA GGACCCGGCC ACCAGCACCC TGACCGTGGC CAACGGCGGC
CACCTCCCCC CCGCCCTGCG CCACCCCGAC GGCACCGCCG ACCTCCTCGA CGACCCCCAC
GGGATCATCC TCGGCGTCAC CGAACAGACC TTCACCGAAA CCCGCCACCG CTTCCCCCCC
GGCTCCACCC TCGCCCTCTA CACCGACGGC CTCGTCGAAT CCCCCACCGT CGACATCGGC
GAGGGCTGCC GACGCCTGCT GCGCATCCTC ACCGAAACAG CCGACCTGCC CACCACCGCG
GACCGGCTAC TGACCCTCCT CAACCGCAAC GACGGCAACG ACGGCTACGA CGACGACGTC
ACCCTCCTCC TCGCCCACGC CCGGCGGGGT CGGCGAGCTC AGACCGAGAG CAGCCCATGC
TCCGCGCACC GGGCCGACCA ACCCACCGGG GTGACCTGGA CGACCATCCG CCGCCGGCAG
GCTCGACAGT AG
 
Protein sequence
MNPTPQPDGQ VRSDICGPRP GLPDPPALAR WEEALLCLRV FDTAPIIFAV FDRGLRYRAV 
NQAAARASGL SAEEMIGGYG PEVLPGIDPL GWERLRRVLA TGEAILDEEI IGATPADPSN
PRIWRASYYP LHDSTGARIA VGVTAVDVTD ERRAQAERDL VLRRLRLLSR ASGLLGASLD
LSATLREMVT LVVPEFADAC ELYLADEPCP PGAEPDPPLL RRAVWAHSPD LPRPSPDLPL
PPPDTRSNLT GRQVGRVFLT RQPVRLDLDE RLPDAPDAPD APDAPDAPDA PDAPDAPDAP
DAPDAPASSV AALFRHSHLR SAIIVPLLVG PHCLGTVGFA VTAARPYREQ DTQTATELGS
RIATAIANAH AFDHQRTAAL TLQRALLPRD IPTLDDLDLA WRYQPGTSGT EVGGDWFDVI
PLPAGRVALV IGDVMGRGLN AAAAMGQLRT AARTLARLDL PPAALLTELD AVTRSIDTIA
SVAYVIQDPA TSTLTVANGG HLPPALRHPD GTADLLDDPH GIILGVTEQT FTETRHRFPP
GSTLALYTDG LVESPTVDIG EGCRRLLRIL TETADLPTTA DRLLTLLNRN DGNDGYDDDV
TLLLAHARRG RRAQTESSPC SAHRADQPTG VTWTTIRRRQ ARQ