Gene Francci3_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0158 
Symbol 
ID3903089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp185247 
End bp188675 
Gene Length3429 bp 
Protein Length1142 aa 
Translation table11 
GC content67% 
IMG OID637877490 
Producthypothetical protein 
Protein accessionYP_479279 
Protein GI86738879 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGACG ATCTTCTTGC CAGGGTCCAG GGGCGACTGG AGCTGTTCGC CGCCGAAAGC 
TCCCCCGAGA TCGTGTTGGC CAACGATGCG GTGTTGGAGG TGATGGGTCT CTTGAACGCC
GTCCCGGATC CGTCCATCGA CACGGATGTT CTGCTCGCGG CCGGTCTTCT GTACTGGTGT
CGTTATCTTG TGCTGGGAGG TGACGATGGC CGGCCGGACC TGGCGCGGGC ACGGGAACTG
CTGGCATCGG TCCACCGGTC GAATCCGACC CTTCTACCCA GCGAGATCCG TGACGCCCTT
GACGAGGCTG ACATGAACCC GAGCAGCCGG GAAAGGTTGG CACTTCGCGC CGAACACCTG
TACGGCCAGT CGACGAAAAC CGGTGACGTC GACGGTCTCA GCGAGGCGAT CGCCCTGTTC
CGACGCGCGT CTGCTGTTAC GCCCGTGAGT CATCCGCTTC ACGTCGGGCT CCTTTCCAAT
CTCGCGACGG CGTTACAGAC GCGGTTCGCC TGGACGGGTT CGGACTCCGA CATCGACGAG
GCGGTCGACC TCGGCCGGCT GGCCGCCACC CGGGCACCGG AGAACCACCC TTCCCGCTTC
CTGGTGCTGT CGGGTCTCAG TGGTTCGCTA TGGGCGCGAT GTGTGCACCG GAAGTCGCCG
GCGGACCTGG AGGAATCCCT GCAGGCAATC CGGCAGGCGG TCGCGGTCAT CCCGCCACAA
GATCCTAATA GCGGCCGTTA TCTGTCAAAC CTAAGTAACA TTCTGCGTTC CCGGTTCGAG
TGGACGGGCG CGCGAGCCGA TCTCGACGAG GCAGTGGAAC AGGGTCGACG CGCCGTCGAT
GTGACGCCGG CCCACCATCC TCAGTACGCG ACGATGCTCA CCAATCTGGC CGTCGCGCTG
CAGACGCGTT TCGTGCAGGC CGGGGTCTCG ACGGATCTAA CGGCCGCCAT CGATATCTTT
GGCCGCGCGG CCACAGTAAC GCCTCCCAGC CATCCGCACT TTCCGGTCGT CTTGGGAAAC
TTGAGTGCTG CGCTGCTGGT CCGGGCGCTG CACACCGGGA CGGACTCCGA CCTCACCACG
GCGGTGGAGA CGGCCCGGCG GGCAGTGGCT GTGACTCCTC CCGGCAGCCC GGACCGTGCC
CGGCGTCTCT CCAATCTGGG CAACATTCTC CGGGCACGTT TTGATCGGGT CGGGCTGCTG
GTTGACCTGG ATGAGGCTGT GGAGGTCTGC CGGCAGGCGG TAGTGGCGAC GCCGGCCAGC
CACGCGGAAC GGGCCGTGAT ACTGACCAAT CTTGGGGCCG TGGTGGGTCT GCGCGCGGAT
CGAATCGGGC GCGCGGCCGA CCTGGACGAG GCAGTCACCG TCGGCCGGCA GGCGGCGGCT
GCCACCTCGA CGGAGCACAC GGCTTGGGTC CCCGTCATGG TGAACCTCTG CAGGGCGCTG
TCACGACGCG CCCGGCTGGC TGGTACCTCC GCAGATCTGG ACGAGGCGGT GGAAACTGCC
CGCGCGGCGC TCGCCGCCGC CGAGGCCAAA GAGAACAGAG CCTTTGTCGC GGCCGCGGCG
TCGAATCTCG GCGAGACCCT CCATCTCCGC TTTGACCGGA CAGAAGACAT GCCCGACCTG
GACGGTTCCG TGGCGGCGTA CCGGATGGCG GTCGATGCTC GCGGCGATGA CCCGGATGCT
GCCACTTCCC TGTCCGGCCT CGGTCTTAGT TTGTGGAACC GTTTCGAACA CACTGGGAGG
CCGGCGGATC GGGATGTCAG TATCGCCGTG TTTCGACGTG CCGCCGCGCT GGCAACGGCA
GCACCCAGCA TCCGAGCGAA GGCGGCGGGA GCATGGGCGA GCCTGGCGGC CACCGCGGGC
GACTGGCAGC AGGCTGTCGC CGGCTACAGC ACCGCTGTGG ACCTGCTGGG ACAGGTCGCG
CCGCGCAGCC TTGACCGCGA AGACCAGGAG TATCGTCTCG TCACCCTGTC CCGGCTGGGA
TCGCAGGCCG CCGCAGCGTG TTTACAGGTC GGCAAGGTCG AACGCGCCCT GGAACTCTGG
GAGCAAGGAC GCGGAGTCAT CCTCGGCCAG ATTCTCGACG CCCGTACGGA CCTCGCCCTC
CTTGCCGCAA GAGACCCGGA GAAGGCCGCG CTGTTCAGGC GGCTTCAGGA TGAGTTCGAC
GCTCCTCCCG CCTTCGACGG ATCCGACATA CCGCTGGCGG AGCAGGCATC CCCGCCATTC
GTCGGAAACG ACGCGACGGG CACGGCACGA CGCGGTGCCG ACCGGCGGCA TGCCCGCGCG
GCGCGGTTCG CAAGCCTCGT CTCCGAGATC CGCAGCCTGC CGGACTTCGA GCGTTTTCTT
CTGCCACCAA CCATCGACGA TCTTCGGCAC GCCGCGAGCC AGGGGCCGAT CGTGGCCGTC
AACGTCAGCG AGATCCGTTG CGATGCCCTG ATCCTGACCA CGGCCGGCGT GCAGCTCCTA
CCCCTGCCGG ACCTCACTGA AGAAGCCGTT GGTGACCAGG TCCTCGCTTT CCTGACGGCT
GTCGAACGCG GCGACGAGAA GGGGCTCTCG AACGTCTTCG GCTGGCTGTG GGACGTCCTC
GCCGGACCGG TGTTGGAACG CCTGGACATC CACGGACCAC CGGCAACGGG CACCTCGTGG
CCGCGGATGT GGTGGTGCCT GTCGGGGCTG TTGTCGTTCC TTCCCGTGCA CGCGGCAGGC
CATCACCAGG CTCGATTCGA TCCGGCGCCG GACACGCTGA TCGATAGAGT GATCTGTTCG
TACACTCCAA CGATCCGCGC CTTGGGCCAT GCTCGGCGCA CCGCGCCCGA CGCGGCGACG
CTTGTCGGCC TTCCGTCAGC GAACGACGAG GGCAGGCGCG CGCTGGTCGT GGTGATGCCC
CACACCCCCG ACGCCGGCGA CCTGCCCGGC GCACACCTCG AAGCCGCCAT CCTCACGCGG
ATTCTGCACG AACGGGTGAG CACGCTGGTC CAGGACAAAG CGACCCGCGC TGCGGTGCTG
GCCGCCCTGC CGCAGGCACG CTGGGTGCAC TTCGCCTGCC ATGGCGAGGC AGCCATCTCC
GCCCCATCGA CCAGCCGTCT GCTACTACAC GACCAGCCCC TCACCGTTCT GGACGTCAAC
CGCCTCAGGC TCACCGACGC CGAACTTGCA TACCTGTCCG CGTGCGAAAC CGCCCGCCCG
GGCGGCGAGC TTTCCGACGA GGCGATGCAC CTCGCCTCCG CCTTCCAGCT CGCCGGCTAT
CGGCATGTCA TCGCCACGCT GTGGCCCATC AACGACCAGA TCGCCGTCGA CCTCGCGGAA
AATATCTACA GGTTCCTAGC CGACGGTAGC GACGTGGCCG CGGCCGTTCA CAACGCCACC
CGTGCCCAGC GCAACTACGC GCCGCGATCC CCGTCACAGT GGGCGTCCCA CATCCACGTC
GGCGCCTGA
 
Protein sequence
MRDDLLARVQ GRLELFAAES SPEIVLANDA VLEVMGLLNA VPDPSIDTDV LLAAGLLYWC 
RYLVLGGDDG RPDLARAREL LASVHRSNPT LLPSEIRDAL DEADMNPSSR ERLALRAEHL
YGQSTKTGDV DGLSEAIALF RRASAVTPVS HPLHVGLLSN LATALQTRFA WTGSDSDIDE
AVDLGRLAAT RAPENHPSRF LVLSGLSGSL WARCVHRKSP ADLEESLQAI RQAVAVIPPQ
DPNSGRYLSN LSNILRSRFE WTGARADLDE AVEQGRRAVD VTPAHHPQYA TMLTNLAVAL
QTRFVQAGVS TDLTAAIDIF GRAATVTPPS HPHFPVVLGN LSAALLVRAL HTGTDSDLTT
AVETARRAVA VTPPGSPDRA RRLSNLGNIL RARFDRVGLL VDLDEAVEVC RQAVVATPAS
HAERAVILTN LGAVVGLRAD RIGRAADLDE AVTVGRQAAA ATSTEHTAWV PVMVNLCRAL
SRRARLAGTS ADLDEAVETA RAALAAAEAK ENRAFVAAAA SNLGETLHLR FDRTEDMPDL
DGSVAAYRMA VDARGDDPDA ATSLSGLGLS LWNRFEHTGR PADRDVSIAV FRRAAALATA
APSIRAKAAG AWASLAATAG DWQQAVAGYS TAVDLLGQVA PRSLDREDQE YRLVTLSRLG
SQAAAACLQV GKVERALELW EQGRGVILGQ ILDARTDLAL LAARDPEKAA LFRRLQDEFD
APPAFDGSDI PLAEQASPPF VGNDATGTAR RGADRRHARA ARFASLVSEI RSLPDFERFL
LPPTIDDLRH AASQGPIVAV NVSEIRCDAL ILTTAGVQLL PLPDLTEEAV GDQVLAFLTA
VERGDEKGLS NVFGWLWDVL AGPVLERLDI HGPPATGTSW PRMWWCLSGL LSFLPVHAAG
HHQARFDPAP DTLIDRVICS YTPTIRALGH ARRTAPDAAT LVGLPSANDE GRRALVVVMP
HTPDAGDLPG AHLEAAILTR ILHERVSTLV QDKATRAAVL AALPQARWVH FACHGEAAIS
APSTSRLLLH DQPLTVLDVN RLRLTDAELA YLSACETARP GGELSDEAMH LASAFQLAGY
RHVIATLWPI NDQIAVDLAE NIYRFLADGS DVAAAVHNAT RAQRNYAPRS PSQWASHIHV
GA