Gene Francci3_0163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0163 
Symbol 
ID3903094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp192225 
End bp193595 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content70% 
IMG OID637877495 
Producthypothetical protein 
Protein accessionYP_479284 
Protein GI86738884 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCA GGGCGAAATC GCATCAACCC CGGCCCGGGA AGCGGCCCCG GCCGTACGGC 
CCGATCCTGC TGGCCGGCCT GGTGGTCCTT GTCGTCGCCC TGCTGGCGGG CTGGGCCCTC
CGTGACCCGG GCAACCAGGC TGCGAACGCT GGAGCCCTCC CGCAACCGCC TGCGTCCCCG
GGCGCGTCCC CGGGCGCGTC CCCGGGCGCG TCCCCGGCCG CGTCCCCGGC CAGGGCGGGA
GCAAGCCCCA CCGCCGCCGT TGCTGGTGGT ACCACCGTTG CTGGTGCTAC CACCACTGCT
GTGGATGTGT ACGCCCACGC CCATGCCGGG ATGCTGAGCC CGGTCGTGCG CGACGATCCG
CAGCTCGTCT ATGTCCCGAA CCTGTCCGAC GGCACGGTCA GCGTCATCGA CCAGCACACG
CTGCGGGTAG TGGCCACCTA CCCCACGGGG CGCGGCCCGC AGCATGTGGT CCCGTCCTGG
GACCTGCGGA CACTGTGGGT CAACAACAAC GTCGGGAACA GTCTGTCACC GATCGACCCG
CGGACCGGGC GCCTCGCCGG CCGGGCCATA CCGGTCACCG ACCCTTACAA CCTGTACTTC
ACCCTCGACG GAAAGAGCGC GATGGTCATC GCCGAGGCGA ACCACAGCGT CGACTTCCGC
GACCCGCACA CCTTCGCCCT GCGCCACAGC CTCGACGTGG GCAGCCGGTG CGCGGGCGTC
AACCACGTCG ACTTCTCCCC GGACGGCACC TATGCGATCG CCACCTGCGA GTTCGCCGGC
CAACTGGTCA AGATCGACCT GGCGCACGAG CGGGTCCTCG GCTACCTCGA TCTCGGACGC
ACCGCGGCCC CCCAGGACAT CAAGATCGAT CCTGCGGGCC GGGTCTGGTA CGTCGCCGAC
ATGAACGCCG GCGGCGTCCA CCTCATCGAC GGGGACACCC TCACCAAAAC CGGCTTTGTC
CAGACCGGCC CGGAGGCCCA CGGCCTGTAC CCCAGCCGCG ACGGCCGCTA TCTCTACGTC
GTCAACCGGG GCGGCATGAT GACCTCCGAC ATGGTCCCGT TCCCGCATAC CGGTGATCGG
GGCTCCGTGT CGGTCATCTC CTTCACCACC AGGAAGATCG TCGCAAACTG GGCCATCCCC
AGCGGCGGGA CCCCCGATAT GGGCAACGTC AACGCCGACG GGACCCGGCT GTGGCTGTCC
GGCCGGCGCA GCAACGTCGT GTACGTGTTC GATACCGGCG GGAACGGCGG GTCGGACCCG
ACGACCGGCC GGCTGTTGAC CGAGATCCCG GTGGGCCACG AGCCGCACGG GCTCGCTGTC
TGGCCGCAGC CCGGGCGCTA CTCCCTCGGC CACACCGGAA TCATGCGATA A
 
Protein sequence
MPTRAKSHQP RPGKRPRPYG PILLAGLVVL VVALLAGWAL RDPGNQAANA GALPQPPASP 
GASPGASPGA SPAASPARAG ASPTAAVAGG TTVAGATTTA VDVYAHAHAG MLSPVVRDDP
QLVYVPNLSD GTVSVIDQHT LRVVATYPTG RGPQHVVPSW DLRTLWVNNN VGNSLSPIDP
RTGRLAGRAI PVTDPYNLYF TLDGKSAMVI AEANHSVDFR DPHTFALRHS LDVGSRCAGV
NHVDFSPDGT YAIATCEFAG QLVKIDLAHE RVLGYLDLGR TAAPQDIKID PAGRVWYVAD
MNAGGVHLID GDTLTKTGFV QTGPEAHGLY PSRDGRYLYV VNRGGMMTSD MVPFPHTGDR
GSVSVISFTT RKIVANWAIP SGGTPDMGNV NADGTRLWLS GRRSNVVYVF DTGGNGGSDP
TTGRLLTEIP VGHEPHGLAV WPQPGRYSLG HTGIMR