Gene Francci3_1176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1176 
Symbol 
ID3905287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1404171 
End bp1406099 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content71% 
IMG OID637878508 
Productstage II sporulation E 
Protein accessionYP_480284 
Protein GI86739884 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.95059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAGA CGGCTGGCCT TCGGCGTGGC ATCCGGGTGC TGACGCTGCT CGCCAGCATC 
TCGATCATCG TGATGCTGGT GTTGGCCGTG GTCTCCGCCC TGCGGTCACG TCATGCCGCG
ACGGAGCGGA GCAGGCATCT GGACCCGGCT GCGACCACCA CCGCGATGCT CCTGGCCGAC
TTTGTCGACC AGGAGAATGC TCTGCGTGGC TACATCATCA CCCGGGACCG GGGTTTCCTG
GTCCCCTACA ACGAGTCCGC CAGGTCCATC CCGGTCCTGA CGGCACGGCT GGACGCATTG
CTGGCCGACT TCCCTGCGCT GCGCGCACAG CACGGGGAAG TCGGCCAGGC GTACCGGGAC
TGGCGTCGCG AGGTTGTCCG GCCGGAACTG GTCGCGATGG CGAGAGGAGA CACCGCCACC
GCCCAGGATA TCGTGCGCAC CAAAGCGCGC CAGGACTTCG ACCTGCTGCG CCGCGAGGTC
GCGGAGCTCG CCGCGGCGAT CGACCGTGAG CAGGTCGAGG CGTCCGGGCG GGTGGAGAGT
GCCTCCGTGC TGCTGCTGAG CTCGCTGGCC AGCGCGATGT TTGTCATCCT CGGCTTCCTG
CTGACCATTA TGATCATCTC GCGGCGATGG CTGCTCCGCC CGATCGGAGC GTTGCAGCGA
TCCGTCAACG CGGTGGCCGC CGGTCGCTAC GACACCCGGA TCCCCTCCGT CGGCCCCAAG
GAGATCGTGG AGCTGGCCGC CGACGTCGAG ACGATGCGCG CCCAGCTGGT GCGGCTCGTC
CGCCAGAACG AGCGGTCGTG GGAGGCCCTG GCCCAGCAGG GACCCGCCGT GATCGCACTG
CGGGACGCGC TCACGCCCTC GCTCCTGCGG GCCCGCGGCC TGGTCCTGCA CGGCCGGGTC
GATCCGGCCG AGGGCGAACT TGCCGGGGAC TGGTACGACG CCTTCGAGCT GCCCGACGGG
CGGGTCGCCG TCGTGGTCGG TGACGTCTCC GGTCATGGAG CCGCGGCCGG GGTCTTCGCG
TTGCGGCTCA AGCAACTGCT CGACGCGGCT CTGTCGACGG AGATCGATCC CGGTCGGGCC
CTGGAGTGGA CGGTGGACAA CCTCGGCGAG ATCGAGGAGA TGTTCGCGAC CGCGATCATT
GCCGTCGTCG ACCCGCGGAC CGGGGACCTG TACTACGTCA ACGCCGGTCA TCCCGACGCG
CTCCTGCTGC GCCGGGCGGT GCCCGGGAAC TCCGGGAACA TGGCGTCGGC GGGCGAGTCG
CCCGTGGGCG AGGTGTGGGT GGGTGAGCTG CCCACAGGCA AGGTACCTGC GGACGCGGTG
CTTCCCGACG AGGCATCCTC ACCCCAGGTA CCCGCTCCGA CCGCTCACCT GGTCGGACCC
CGCGATCCCG CCCGGCCGGG ATGCGGCCCC GGTGACGGGG CGGGCGGATC CGTCATGGCG
GCTGCTGTTG GATCCGCGCA GGCCGTCGGA TCCGCGCAGG CCGTCGGATC GGCCTGCGGG
CCGGGGGATG GGGCGAACGG TGCCGGCCCC GGGTCCGGTC GGGTGCCGGG AGCCGGTTCC
ATCGTCGGTT CCCGGCTGGG AACGGCGCAG GTCGTTCGGC TGCCTCCGAC GGGCCCGTTG
ATGTCGAGCC TGCTCGCGGA GCCCGGGGCG TGGGGAATCC AGACGCTGCG GCTCGAACCG
GGCGACGTGC TCTTCGCCTA CACCGACGGA CTGGTGGAGG CACGTGACGA GGCCGGGCGG
CAGTTCGGGC TGCACCGGCT GATCGCCGAG ATCCTGCGCG ATCCCACGCG CACGCCGGTG
GCGCTGCTCG ACGACGCCTT CGACGCGGTT CGCCGGTACG CGCCCGGACG GCAGGGCGAC
GACCGTACGG CGATCATCCT CGCCCGTACC GCCCAGCGTG GCTCCACGAT CGCGCCCGGA
GAGTCATAG
 
Protein sequence
MGETAGLRRG IRVLTLLASI SIIVMLVLAV VSALRSRHAA TERSRHLDPA ATTTAMLLAD 
FVDQENALRG YIITRDRGFL VPYNESARSI PVLTARLDAL LADFPALRAQ HGEVGQAYRD
WRREVVRPEL VAMARGDTAT AQDIVRTKAR QDFDLLRREV AELAAAIDRE QVEASGRVES
ASVLLLSSLA SAMFVILGFL LTIMIISRRW LLRPIGALQR SVNAVAAGRY DTRIPSVGPK
EIVELAADVE TMRAQLVRLV RQNERSWEAL AQQGPAVIAL RDALTPSLLR ARGLVLHGRV
DPAEGELAGD WYDAFELPDG RVAVVVGDVS GHGAAAGVFA LRLKQLLDAA LSTEIDPGRA
LEWTVDNLGE IEEMFATAII AVVDPRTGDL YYVNAGHPDA LLLRRAVPGN SGNMASAGES
PVGEVWVGEL PTGKVPADAV LPDEASSPQV PAPTAHLVGP RDPARPGCGP GDGAGGSVMA
AAVGSAQAVG SAQAVGSACG PGDGANGAGP GSGRVPGAGS IVGSRLGTAQ VVRLPPTGPL
MSSLLAEPGA WGIQTLRLEP GDVLFAYTDG LVEARDEAGR QFGLHRLIAE ILRDPTRTPV
ALLDDAFDAV RRYAPGRQGD DRTAIILART AQRGSTIAPG ES