Gene Francci3_2597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2597 
Symbol 
ID3906503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3064426 
End bp3066210 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content73% 
IMG OID637879922 
Productputative PAS/PAC sensor protein 
Protein accessionYP_481688 
Protein GI86741288 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.158438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCGA CGCCGCAGCC CGATGGTCAG GTACGCAGCG ACATCTGCGG GCCTCGGCCC 
GGCCTGCCGG ACCCGCGGGC ACTCGCCCGG TGGGAAGAGG CGCTGCTGTG TCTGCGGGTG
TTCGACACGG CGCCGATCAT ATTCGCGGTC TTCGACCGCG GCCTGCGCTA CCGGGCGGTG
AACCAGGCGG CTGCCAGGGC CTCCGGCCTG TCCGCCGAGG AGATGATCGG TGGGTATGGC
CCGGAGGTGC TCCCGGGCAT AGACCCTCTC GGCTGGGAGC GGCCCCGCCG GGTGTTGGCG
ACCGGTGAGG CGATCCTCGA CGAGGAGATC ATCGGAGCGA CCCCGGCGGA CCCGTCGAAT
CCGCGCATCT GGCGGGCCTC CTACTATCCC CTGCATGACT CGACCGGGGC CAGAATCGCG
GTCGGCGTGA CGGCGGTGGA CGTGACCGAC GAGCGTCGCG CGCAGGCCGA ACGTGACCTG
GTACTGCGGC GGTTGCGGCT GCTGAGCCGG GCGAGCGGGC TGCTCGGCGC GTCGCTGGAC
CTGTCGGCCA CCTTGCGGGA GATGGTCACG CTGGTCGTCC CGGAATTCGC CGACGCCTGC
GAGCTGTACC TCGCCGACGA GCCCTGCCCC CCGGGGGCCG AACCCGACCC GCCGCTGCTT
CGCCGAGCCG TGTGGGCACA CTCCCCCGAC CTGCCCCGAC CATCCCCCGA CCTGCCCCTC
CCCCCACCAG ACACAAGAAG CAACCTGACC GGACGCCAGG TCGGACGGGT ATTCCTCACC
CGCCAACCGG TCCGCCTCGA CCTGGACGAA CGGCTCCCCG ACGCACCCGA CGCACCCGAC
GCACCCGCGA GCAGCGTGGC CGCCCTGTTC CGGCATTCCC ACCTTCGATC CGCGATCATC
GTCCCGCTGC TGGTCGGCCC GCACTGCCTG GGAACCGTCG GGTTCGCCGT CACCGCGGCC
CGCCCCTACC GCGAGCAGGA CACCCAGACC GCCACCGAAC TGGGCAGCCG GATCGCAACC
GCGATCGCGA ACGCCCACGC CTTCGACCAC CAGCGGACCG CCGCCCTGAC CCTGCAACGC
GCCCTGCTGC CCCGCGACAT CCCCACCCTG GACGACCTCG ACCTGGCCTG GCGCTACCAG
CCCGGCACCA GCGGCACCGA GGTCGGCGGC GACTGGTTCG ACGTCATCCC CCTGCCGGCC
GGACGGGTCG CGCTGGTCAT CGGCGACGTC ATGGGCCGCG GCCTGAACGC CGCCGCCGCC
ATGGGCCAGC TACGCACCGC CGCCCGCACC CTCGCCCGCC TCGACCTGCC CCCCGCCGCC
CTGCTGACCG AACTCGACGC CGTCACCCGC AGCATCGACA CCATCGCCAG CGTCGCCTAC
GTCATCCAGG ACCCGGCCAC CAGCACCCTG ACCGTGGCCA ACGGCGGCCA CCTCCCCCCC
GCCCTGCGCC ACCCCGACGG CACCGCCGAC CTCCTCGACG ACCCCCACGG GATCATCCTC
GGCGTCACCG AACAGACCTT CACCGAAACC CGCCACCGCT TCCCCCCCGG CTCCACCCTC
GCCCTCTACA CCGACGGCCT CGTCGAATCC CCCACCGTCG ACATCGGCGA GGGCTGCCGA
CGCCTGCTGC GCATCCTCAC CGAAACAGCC GACCTGCCCA CCACCGCGGA CCGGCTACTG
ACCCTCCTCA ACCGCAACGA CGGCAACGAC GGCTACGACG ACGACGTCAC CCTCCTCCTC
GCCCACGCCC GGCGGGGTCG GTGCGGCCGG TTCAGGGGGC GGTGA
 
Protein sequence
MNPTPQPDGQ VRSDICGPRP GLPDPRALAR WEEALLCLRV FDTAPIIFAV FDRGLRYRAV 
NQAAARASGL SAEEMIGGYG PEVLPGIDPL GWERPRRVLA TGEAILDEEI IGATPADPSN
PRIWRASYYP LHDSTGARIA VGVTAVDVTD ERRAQAERDL VLRRLRLLSR ASGLLGASLD
LSATLREMVT LVVPEFADAC ELYLADEPCP PGAEPDPPLL RRAVWAHSPD LPRPSPDLPL
PPPDTRSNLT GRQVGRVFLT RQPVRLDLDE RLPDAPDAPD APASSVAALF RHSHLRSAII
VPLLVGPHCL GTVGFAVTAA RPYREQDTQT ATELGSRIAT AIANAHAFDH QRTAALTLQR
ALLPRDIPTL DDLDLAWRYQ PGTSGTEVGG DWFDVIPLPA GRVALVIGDV MGRGLNAAAA
MGQLRTAART LARLDLPPAA LLTELDAVTR SIDTIASVAY VIQDPATSTL TVANGGHLPP
ALRHPDGTAD LLDDPHGIIL GVTEQTFTET RHRFPPGSTL ALYTDGLVES PTVDIGEGCR
RLLRILTETA DLPTTADRLL TLLNRNDGND GYDDDVTLLL AHARRGRCGR FRGR