Gene Francci3_0685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0685 
SymboldeoA 
ID3905273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp783059 
End bp784363 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content73% 
IMG OID637878018 
Productthymidine phosphorylase 
Protein accessionYP_479798 
Protein GI86739398 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0366488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGAA GCTTCGACGT CGTCGATCTG ATCCGGGCCA AGCGGGACGG TCGACCCGTC 
GATCCCCGGG CCGTCGACTG GCTCGTCGAC GCCTACACCC GGGGTCTGGT TGCCGACGAG
CAGATGTCGG CGTACCTGAT GGCGGTGGTG TGGCGCGGGA TGACTCCCGC CGAGCTGGAC
CGCTGGACCG CCGCAATGAT CGACAGTGGG GAGCGCCTGG ATCTGACCGG TGTGGGACGT
CCCACCGTCG ACAAACACTC CACCGGAGGG GTCGGCGACA AGGTCTCGCT CGTGCTGGTG
CCGCTGGTCG CCGCCTGTGG GGCCGCGGTC CCACAGCTCG CCGGGCGGGG GCTCGGTCAC
ACCGGTGGAA CGCTGGACAA GATGGAGGCG ATCCCCGGCT GGCGGGCGGA TCTGTCGGCC
GCCCGGATGC GTGAGCTCCT CGCCGAGGTG GGTGCGGTGA TCGCCGCCGC GGGTGCCGGC
CTCGCCCCGG CCGACCGCCG GCTCTACGCG CTGCGGGACG TCACCGGAAC CGTCGAGTCG
ATCCCGCTCA TCGCATCGTC GATCATGAGC AAGAAGATCG CCGAGGGCAC GTCGGCGCTG
GTCCTGGACG TCAAGGTCGG CTCCGGGGCC TTCATGACGT CGGTGGATGA GGCCCGTGAG
CTCGCCCGGA CGATGGTTCG GATCGGCGTT GCCGCCGGGG TCCGCACCGA GGCCCTGCTG
ACCGGGATGG ACCATCCCCT CGGCCGGACC GCCGGGCATG CGCTGGAGGT GGCCGAGGCC
GTGGAGACCC TCCGTGGTGG TGGGCCGGCG GATCTGGTGG AGGTCACCGT CGCGCTGGCC
AGGGTGATGA TCGACCTCGT CGCCGCCGAA CTCGGCCACC GGTCCGGTGC TCTTCATGAT
CCCGCGCAGG TACTGGCTGC CGGGGACGCT TTTGCGGTGT GGCGGGCGAT GGTCGCGGCC
CAGGGCGGCG ATCCGGACGC GCCGCTTCCG GCGGCGAGCC ATGTCGAGAC CGTCCCCGCG
CCGGCGACCG GCCATCTCCA CCGCCTGGAC GCCCGAGCGG TCGGCCTGGC GGCCTGGCGG
CTTGGCGCGG GCCGGGTCCG CAAGGAGGAC GCGGTCTCCG CGACGGCGGG CGTGCGGTGG
CGGGTGGGAA TCGGTGACCC GGTCACGGCC GGCGAGCCGT TGCTGGAACT GCACACCGAT
GACCCGGCCA GCGTCGAGCG GGCCCGGGAG GCCCTGGCGG GAGCGGTGGA GGTCGCCGCG
ACGCCGCCCC CGAGCACACC GCTTGTCCTC GATCACATCA GCTGA
 
Protein sequence
MSGSFDVVDL IRAKRDGRPV DPRAVDWLVD AYTRGLVADE QMSAYLMAVV WRGMTPAELD 
RWTAAMIDSG ERLDLTGVGR PTVDKHSTGG VGDKVSLVLV PLVAACGAAV PQLAGRGLGH
TGGTLDKMEA IPGWRADLSA ARMRELLAEV GAVIAAAGAG LAPADRRLYA LRDVTGTVES
IPLIASSIMS KKIAEGTSAL VLDVKVGSGA FMTSVDEARE LARTMVRIGV AAGVRTEALL
TGMDHPLGRT AGHALEVAEA VETLRGGGPA DLVEVTVALA RVMIDLVAAE LGHRSGALHD
PAQVLAAGDA FAVWRAMVAA QGGDPDAPLP AASHVETVPA PATGHLHRLD ARAVGLAAWR
LGAGRVRKED AVSATAGVRW RVGIGDPVTA GEPLLELHTD DPASVERARE ALAGAVEVAA
TPPPSTPLVL DHIS