Gene Francci3_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3501 
Symbol 
ID3905235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4174895 
End bp4178572 
Gene Length3678 bp 
Protein Length1225 aa 
Translation table11 
GC content75% 
IMG OID637880823 
Productputative PAS/PAC sensor protein 
Protein accessionYP_482583 
Protein GI86742183 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.480314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.404449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCCCC CGGGATCCGC CTCCCCGTCA TCGGGCGTTG TTCCGTCCGG CCAGGGGGAT 
GTCTCCCCGG GACCGAATCC GGTCACCCGA TCCCAAGGGG GACCGTCGGC TGGCCCACAC
ACCAACCCCG CCCCACACAC CAACCCCGCC CCACGCGCGC ATCCCGGCCC GAACCAGGTC
CCCGGCCCGA AGCACGCTCC CGGCCCGGAG ACCGGCCCCG GTCAAGACCG GGGAGCCGAC
CCCGAGCTAC CCGGACTACC CGAGGGCTGG CCCATGGTCG CCCGGCCGGC CGCTTCGGCC
GTGACCGCCG AGCCGGTCAC GCGGGACGTC ATCGACGGCA CCGAGCCGCG CCGCGCGCCG
GAGGCGCCAG GGATGCGCCG GTCCGATCCG GCGGGGCTGC CGGGGCGTGT CCGGTCCGCT
TTCCCCCTGC CCGGACACGC CCCGGGAACC GACCCCGCGG CCACGTTCGT GATGCTGGAC
GCGCTGTTCG CGTGGGCCCC GGTCGGTCTC GCCCTGCTCG ACCGGGCCGG GCGGTTCCTG
CGGGTCAACG ACACACTTGC CCGGTTCGAC CGCCGGCCGG TCCACGAGCA CCTCGGCCGT
ACCGTCTCCG AGCTCCTCGG CGACACCGGC CAGGAACTCG ACGCTCTGCT GGCACGGGTA
CTGCGCACCG GCGAGCCGGT GGTGGACTTG GAGGTCATGG TCGCCACCGA TGGTTCCGGG
CCGCCGCAGA CGTGGCTCGC CAGCTGGTAT CCGGTGAACG ACCCTCAGGT CGGGCTGGTG
GGCGTGGCGT TCGTCGCGAT CGACGCCAGC GGGACGCGGG CGGTCGAGGG GGAGCGGGCG
CGGGCCGACG CACGTTACCT CGGCCTCGTG GACGCCGCCG CGGTGGACGT GTTCCACGCC
GAAGGCGACG GGGCACTCGA TGCCGATCTG CCCCGCCTGC GGGCCTTCAC CGGGCGGCAT
CCGGCGGAGC TGGCCGGGTT CGGCTGGCTC GGCGTGGTCC ACCCCGACGA CCGGGAGCGG
GTCGGGCGCG CCTGGCACGG GGCGATCGAG CACGGTGAGA CCTTCGAGGC CGAGTTCCGG
ATCTCCGGTG GCGGAGGCGA CCGCACCGCC ATGCGGGTGG TCGAGGCCCG CATCGTTCCC
ATGCCGGCGG CCGGCCGGCC GAACAGCCGG CCCAGCGAGT GGCTCGGGGT GATCCGCGAC
CTCACCGAGG TACGCGCCGC CGAGGCCGAC CGGGCCACGG CCGACCAGCG GGCCCGGATC
GCGACGGAAC GGGCCGAACA GACCGCGACG TTGGCCGTGG CGCTCGCCCG GACCCTGACC
GTGGACGACG TCGTCGCCAC CGTCCTCGAC GTCGGGGGGC GGATGGCCGG GGCCGCCGGC
CGGGGCGTCG CGCTCGTGGA CGAGGCGCAC GACCGGCTGC TCTTCCACGC CCCGCCGGGT
CCCGCCGACG GCCTCGCCCG CTGGTCGGAG GTGGCGCTGG GGGCGGTACA TCCGGTGGCG
GAGGTCATGC GGGGCGGTCG TGCGCTGTTT CTCGTCGACC GCGACGAGCT GCTCGCCCGC
TGGCCGGTGC CGGAGGTCGC CGACGTCGCC GCCGCGGCGG GTGAGCATGC CTGGGCGATG
CTGCCGCTGG CCGTCGGGGG CGGCGCCCCC TTCGGCGTGG TGACCTTCGG GTTCCGCCAG
GCGCGGGAGT TCACCGCCGC TGATCAGGCG TCCCTCATCG CGATCGCCGA CGCCTGCGCG
CAGGCCCTGG AACGGGCCAC CGGCTACGAA CAGCTCGCCG CCGACGCCGC GCGCGGTCAT
CGGACCCTGG CTGCGACGCG CGAGGCGCAG GCCGCCCTGG CACTCGCCGA TCGACGCCTT
CAGCTGCTGG GGCGAGCTAC CGGGATCGTG GCCGCGGCCG TGGAGCCTCC CGTCGCCCTG
CGCTCCCTGG CCGAGTTGAT CGTCTCGGAG GTCGCCGACC TGTGTGTTGT CCAGCTCGTC
ACTGGCACAC CCGCTCCCGT CCCGGCGTCC TGGAGCGCTG CCGCGGTGTC CGTCCGGGCC
GGGGACGCGG CCGGGGATAA GGCCGGGGAC GCGGCCGGGG ATAAGGCCGG GGCCGAGGAG
ACGGCGGGGG ACCGGGCGGC AGAGCCGGTG CCCGAGCTGC GTCCACTCGT CGTCCTGGCC
CGTGACGGGC TTGGCACGGT GCCTCCGTTC GCCTCGGGGG CCGGCGCGGC GACGTCGCCG
GCGAGTCCGT TCGCCCGGGC CGCCCGCCGG GGCGAACGGC TGATCGTCGC ACTGACAGCG
GGCGAGTGGG ATCCGCCGGC CGACGCCGAG CGGTGGATCC GCCAGGTGGG GGCCCACACG
ATGGCTGTCG TACCCGTGGT ACGGGTCGGC CACGTCGTCG CCGTACTGAG TGTGACCGCC
GTTGCGGATC GACCTCCGTT CACCGAAGCG GATCTGCTCT TGCTGACCGA ACTCGCCGCC
CGGGTGGGGG TCGTCCTGGA CCGGATCGAT CGGGGGGCCG CCGAGCGCAG CAATGCGCTG
GCACTGCGCG AGGCGTTGCG CGGTTCTCCA CCCGCCGTCC CGTCCGGGCT CGAGGTGGCC
ACCCGTTACC TGCCCGGCGG GGTCGATGAC GACGCCGGTA GCGACTGGTT CGACGTGATT
GACCTGGGAG CCGGGCGAGT CGCCCTGATG ATCGGCAACG TGATGGGCCG AGGGATCCGG
GCGACCGCGG TGATGGGGCA GCTGCGCGCG GCGGCCCGCA CCTGCGCCCG TCTCGATCTT
CCCCCGGCCG AGGTGCTGAC GCTGCTGGAC GGCATCGTCG CGGACCTGCC CGGGGAGGAG
ATCGCCACCT GCATCTACGC GGTCGTTGAG ATCGACAGTG GGGTGCTGAC GCTGGCGAGT
GCCGGGCACC CGCCGCCCCT GGTCGTCGCG CCGGACGGGT TGGTCTCCAG GCTCTACATG
GCGGTGGGAT CTCCACTCGG GGTGGCCCGG TCGGACGTGA CCGAGTACAC GGTGCGACTG
GGACGGGGAT ATCTGATCGC CCTGTTCACC GACGGGCTCG TCCGGGGACG TGCGCGCGAC
CTCGACGCCG GGGTCTCGCA GCTCGCGGCC GCGCTCGCCC GCGCCAGCGA CAGGTTCACC
GCGAATCTGG ACGACCTGGT CACCACGGCG TGTGCCGGTC TCGGTCCCGC CGTCGCCCCC
GGCCCAGTGG GTTCCGGGGC GGCCGACGGG GCGGCCGACG AGGTGGCGGC CGATGACGTC
GCGCTTCTGT TCGCCCGGTT GCCCGTTGAA CCGACGGCCG CGGCGGCCCT CCTGGACGTC
ACCTTCGACG GTGCGGCGAG CCTGCGCGCC GTGCGGGCTC AGGCCAGGCT CGCGCTGGAG
AACGCGCCGC TGGCCTCGGA AGTCGTCGAC ACCATCGTTC TGGTGCTGTC GGAGCTGGCG
AGCAACGCGG TGCGGCACGG TCGCCCACCA CTGTCGGTGC GGCTGCGGCT GCTGGGCGAC
CGGGCCGTCG TCGAGGTCGC CGACGGTGGC GGCCGGGTGC TGCGGCGGCG CCACGCCGCG
GCCGAGGATG AGGCCGGCCG CGGGCTCGGC CTGGTCTCCC AGCTCGCCGT TCGGCATGGC
GTCCGTCCGG TCCCCGACGG GAAGGCCGTG TGGGCCGAGA TCGACCTGAC CGGGACGACC
CCGCCCGAAC CGGACTGA
 
Protein sequence
MSPPGSASPS SGVVPSGQGD VSPGPNPVTR SQGGPSAGPH TNPAPHTNPA PRAHPGPNQV 
PGPKHAPGPE TGPGQDRGAD PELPGLPEGW PMVARPAASA VTAEPVTRDV IDGTEPRRAP
EAPGMRRSDP AGLPGRVRSA FPLPGHAPGT DPAATFVMLD ALFAWAPVGL ALLDRAGRFL
RVNDTLARFD RRPVHEHLGR TVSELLGDTG QELDALLARV LRTGEPVVDL EVMVATDGSG
PPQTWLASWY PVNDPQVGLV GVAFVAIDAS GTRAVEGERA RADARYLGLV DAAAVDVFHA
EGDGALDADL PRLRAFTGRH PAELAGFGWL GVVHPDDRER VGRAWHGAIE HGETFEAEFR
ISGGGGDRTA MRVVEARIVP MPAAGRPNSR PSEWLGVIRD LTEVRAAEAD RATADQRARI
ATERAEQTAT LAVALARTLT VDDVVATVLD VGGRMAGAAG RGVALVDEAH DRLLFHAPPG
PADGLARWSE VALGAVHPVA EVMRGGRALF LVDRDELLAR WPVPEVADVA AAAGEHAWAM
LPLAVGGGAP FGVVTFGFRQ AREFTAADQA SLIAIADACA QALERATGYE QLAADAARGH
RTLAATREAQ AALALADRRL QLLGRATGIV AAAVEPPVAL RSLAELIVSE VADLCVVQLV
TGTPAPVPAS WSAAAVSVRA GDAAGDKAGD AAGDKAGAEE TAGDRAAEPV PELRPLVVLA
RDGLGTVPPF ASGAGAATSP ASPFARAARR GERLIVALTA GEWDPPADAE RWIRQVGAHT
MAVVPVVRVG HVVAVLSVTA VADRPPFTEA DLLLLTELAA RVGVVLDRID RGAAERSNAL
ALREALRGSP PAVPSGLEVA TRYLPGGVDD DAGSDWFDVI DLGAGRVALM IGNVMGRGIR
ATAVMGQLRA AARTCARLDL PPAEVLTLLD GIVADLPGEE IATCIYAVVE IDSGVLTLAS
AGHPPPLVVA PDGLVSRLYM AVGSPLGVAR SDVTEYTVRL GRGYLIALFT DGLVRGRARD
LDAGVSQLAA ALARASDRFT ANLDDLVTTA CAGLGPAVAP GPVGSGAADG AADEVAADDV
ALLFARLPVE PTAAAALLDV TFDGAASLRA VRAQARLALE NAPLASEVVD TIVLVLSELA
SNAVRHGRPP LSVRLRLLGD RAVVEVADGG GRVLRRRHAA AEDEAGRGLG LVSQLAVRHG
VRPVPDGKAV WAEIDLTGTT PPEPD