Gene Francci3_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1556 
Symbol 
ID3904788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1865641 
End bp1866732 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content70% 
IMG OID637878893 
Productserine phosphatase 
Protein accessionYP_480661 
Protein GI86740261 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.79241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCGCT GGTTCGCCCC GATCGCCCCG ATCACCCCGA TGCTCATACT CGCCCTGGTG 
GTCGGACTGC AACTGAGCAA CCAGAAATGG AACGTGATCG GACTCGCGAT TCTGAGCCCC
ATGCTCGCGG CCACGTTCGC CGGACCCCGG CTGACCGCAG GCTACGGGGT GGCGGCGGTC
CTGGCGGGCA TCCTGCTGGG TCTGCACGAT GACCTGTTCG GCCGGAGCGG CGGTGGCCCG
ACGGCCCAGG TGGTCCGGCT GGTCGGGGTC ACGGCCGGCG GAGTGATGGC CGTCCTGGTC
AGCCGATACA ACATCCGACG CGAGACGAAG CTGCAGAACG TCACTCGGGT GGCCGAGGTG
GCGCAGCAGA CGATCCTCAG TCCCGTCCCG TCGTCGTCCG GCGGGCTGCG GTTCGCCGTC
CGCTACGAGA GTGCCACCGT GGAGGCCATG ATCGGTGGGG ATCTGTACGA GGTCGTCGAC
AGCCCGTGGG GAACCCGCCT GCTCATTGGT GACGTACGGG GCAAGGGGCT CGACGCCGTG
CGGATCGCGA GCCGGGTGCT CAGCTGCTTC CGGCTGATGA GTCGACGCAC GGGCGGCCTG
CGCGATCTGC TGGCGAACCT CGACGCGGAG GTCGCCGATG CCAGCTGCCT GGACGACTTC
GTCACCGCGG TCGTCGGGCA GGTCGACGGC AGCCGTCTGA CGCTGGCGAA CGCGGGACAT
CCCGATCCCG TTCTCGTTCG TGCCGGGCAG GCGGATCTGC TCACTGTCTC GTCCAGGCTG
CCACCGCTCG GGCTGATCAC GGACGGGAGC AACGTGACGG ACACCGTGCT GCGGGCGGGG
GATCGCCTGC TGTTCTATAC CGACGGCATC ACCGAGGCAC GCGCCCCCAC GACCGGCGCC
TTCTTCCCCC TGCTGCCCGC GGCCGAGGCC GCGTTCGCCC ACACGTCACT CGACGAGGCG
CTGACCGATC TCGCCGACCG GGTCCGGGAC TGGACGCGAT CGACGCTGAA CGACGATGTG
GCGCTCCTGG CGGTCGAGGT TCCCGGACCG ACCCGGCACG CCGGCCCAAC CCGACGATCG
GATGATCACT GA
 
Protein sequence
MLRWFAPIAP ITPMLILALV VGLQLSNQKW NVIGLAILSP MLAATFAGPR LTAGYGVAAV 
LAGILLGLHD DLFGRSGGGP TAQVVRLVGV TAGGVMAVLV SRYNIRRETK LQNVTRVAEV
AQQTILSPVP SSSGGLRFAV RYESATVEAM IGGDLYEVVD SPWGTRLLIG DVRGKGLDAV
RIASRVLSCF RLMSRRTGGL RDLLANLDAE VADASCLDDF VTAVVGQVDG SRLTLANAGH
PDPVLVRAGQ ADLLTVSSRL PPLGLITDGS NVTDTVLRAG DRLLFYTDGI TEARAPTTGA
FFPLLPAAEA AFAHTSLDEA LTDLADRVRD WTRSTLNDDV ALLAVEVPGP TRHAGPTRRS
DDH