Gene Francci3_0383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0383 
Symbol 
ID3903435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp456092 
End bp458050 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content70% 
IMG OID637877712 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_479499 
Protein GI86739099 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTCGC GTACCGACCG TTCGTCCTCG TCCACCTCGA AGGCCGTCAC CTCGTCCCCG 
TCCACCTCAT CCTTGTCGTC CGCCGCATCC TCCCCGTCCG TCTCGTCCTC GTCCTCCTCG
TCGTCCGTCT CGGCCGCGGG GATGACGGCG GTCTCGACGG GTCCGCTGAC GGGCAGCCGC
AAGACCTGGC TGGTCGGAGC GGATCCCGAT CTGCGGGTGC CGATGCGGGA GATCGTGCTG
ACCACGGGTG ACACCGTCGT GGTGTACGAC ACCTCCGGTC CATATACCGA TCCGGGGGTG
ACGATCGATG TGCGCCGGGG TCTGCCGGCG ACGCGGGACA GCTGGATCGC CCAGCGTGGC
GACACCGCGC CGGATGAGCG GCGGACCGTT CCCGGGACCG GGGCATCGGG TCCTGGGACG
CTCGGTTCTG GGACGCCCGG TTCTGGGACG CCCGGTTCTG GGCCGCTCGG TCTCGGCGGG
ACGGATCTCG ACGGGCGGGT GCGAGTGCCC CGTCGGGCGG TGCCGGGACG GCCTTCCATA
ACCCAGCTGG GCTACGCCCG CCGCGGTCAG ATCACTCGGG AGATGGAGTT CGTCGCCCTG
CGGGAGGGTC TTCCCGTCGA GACGGTGCGG GCGGAGATCG CCGCGGGGCG GGCCGTGCTG
CCGGCGAACG TGAACCATCC CGAGTCCGAG CCGATGGCGA TCGGCCGTGC GTTTCTCGTG
AAGATCAATG CGAACCTCGG CAACTCGGCC GTTACCTCCT CGATCGAGGA GGAGGTCGAG
AAGATGGTGT GGGCGACCCG CTGGGGCGCG GACACCGTGA TGGACCTCTC GACCGGGTCG
GACATCGCCC TGACCCGTGA GTGGATCATC CGTAACGCGC CGGTGCCGGT CGGGACCGTG
CCGATCTACC AGGCGTTGGA GAAGGTCGGT GGCCGGCCGG AGAAGCTGTC CTGGGAGGTC
TACCGGGACA CCGTGATCGA GCAGTGCGAG CAGGGTGTGG ACTACATGAC GGTCCACGCG
GGGGTGCTGC TGCGCTACGT GCCGCTGACC GCGCGGCGCA GGACCGGGAT CGTCTCGCGC
GGCGGCTCGA TCCTGGCCTC CTGGTGCCTG GCCCATCACG AGGAGAACTT CCTCTACACC
CACTTCGCCG AGCTGTGCGA GATCTTCGCC GCGTACGACG TCACGTTCTC TCTGGGCGAC
GGCCTGCGGC CCGGGTCCAT CGCGGACGCG AACGATGAGG CCCAGCTCGC CGAGCTCGCC
ACCCTGGGCG AGTTGACGCA GGTGGCGTGG GAGCACGACG TCCAGGTGAT GATCGAGGGA
CCCGGGCACG TGCCCATGAA CAAGATCGAG GAGAACGTGC AGCTGCAGCG GGAGCTGTGC
CACGACGCGC CGTTCTACAC CCTCGGGCCG CTGACCACCG ACATCGCCCC CGGCTACGAC
CACATCACCT CCGCGATCGG GGCGGCGATG ATCGGATGGG CCGGTACCGC CATGCTCTGT
TACGTCACCC CGAAGGAGCA TCTCGGCCTG CCCGACCGGG ACGACGTCAA GGCCGGCGTT
ATCGCCTACA AGATCGCCGC GCACGCCGCC GACCTCGCTA AGGGGCATCC CGGCGCGCAG
GCCTGGGATG ACGCCCTGTC GGACGCCCGG TTCGAATTCC GCTGGGCCGA CCAGTTCCAC
CTCGCGCTCG ACCCCGACAC CGCACGCGCG TTCCACGACG AGACGCTGCC GGCCCCGGCC
GCGAAGTCGG CGCACTTCTG TTCGATGTGC GGCCCGCACT TCTGCTCGAT GAAGATTTCC
CACCAGGTGC GGGCACACGC GGGCGGGGAC GGGCTGGACC CGGCCGGGCA TGGCGCGGAC
CCGGCCGGCG ACGAGGCGGT CACCGCCGGC CTGCGCGAGA AGGCCGCCGA GTTCAACGCC
GCCGGCAATC GGATCTACCT TCCGGTGGCC AACTCCTGA
 
Protein sequence
MVSRTDRSSS STSKAVTSSP STSSLSSAAS SPSVSSSSSS SSVSAAGMTA VSTGPLTGSR 
KTWLVGADPD LRVPMREIVL TTGDTVVVYD TSGPYTDPGV TIDVRRGLPA TRDSWIAQRG
DTAPDERRTV PGTGASGPGT LGSGTPGSGT PGSGPLGLGG TDLDGRVRVP RRAVPGRPSI
TQLGYARRGQ ITREMEFVAL REGLPVETVR AEIAAGRAVL PANVNHPESE PMAIGRAFLV
KINANLGNSA VTSSIEEEVE KMVWATRWGA DTVMDLSTGS DIALTREWII RNAPVPVGTV
PIYQALEKVG GRPEKLSWEV YRDTVIEQCE QGVDYMTVHA GVLLRYVPLT ARRRTGIVSR
GGSILASWCL AHHEENFLYT HFAELCEIFA AYDVTFSLGD GLRPGSIADA NDEAQLAELA
TLGELTQVAW EHDVQVMIEG PGHVPMNKIE ENVQLQRELC HDAPFYTLGP LTTDIAPGYD
HITSAIGAAM IGWAGTAMLC YVTPKEHLGL PDRDDVKAGV IAYKIAAHAA DLAKGHPGAQ
AWDDALSDAR FEFRWADQFH LALDPDTARA FHDETLPAPA AKSAHFCSMC GPHFCSMKIS
HQVRAHAGGD GLDPAGHGAD PAGDEAVTAG LREKAAEFNA AGNRIYLPVA NS