Gene Franean1_5934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5934 
SymboldeoA 
ID5674255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7207246 
End bp7208580 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content75% 
IMG OID641244782 
Productthymidine phosphorylase 
Protein accessionYP_001510184 
Protein GI158317676 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0130951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTG ATGCCCCCGG CAGCGATCAC GGCGGTCCTG CGGCGTCCCA CGACGTCGTC 
GACCTCATCC GGGCCAAGCG CGACGGGGCG GCGCTGCCGG CGGACGCGGT CGCCTGGCTC
ATCGACGCCT ACACCCACGG CCGGGTCGCC GACGAGCAGA TGTCCGCCTA CCTGATGGCC
GTGGTCTGGC GCGGCATGGC CTCCGACGAA CTGGATCACT GGACGTCGGC GATGATCGCC
AGCGGCGAAC GGCTGGACCT GTCCGGCCTG ACCCGCCCGA CGGTCGACAA GCATTCCACC
GGCGGGGTCG GGGACAAGGT CTCGCTGGTC CTCGCCCCGC TGGTGGCCGC GTGCGGAGCG
GCCGTCCCGC AGCTGTCCGG ACGTGGGCTC GGGCACACCG GCGGGACGCT TGACAAAATG
GAGGCGATCC CCGGCTGGCG CGCCGATCTC GACCCGGGCA CCATGCGTGC CGTGCTGGCG
GACGTCGGCG CCGTCATCTG CGCCGCCGGC CCCGGCCTGG CTCCCGCGGA CCGCAGGCTG
TACGCCCTGC GCGACGTCAC CGGCACGGTC GAGTCCATCC CGCTGATCGC CTCCTCGATC
ATGAGCAAGA AGATCGCGGA GGGGACGTCC GCGCTGGTCC TGGACGTCAA GGTCGGCGCC
GGCGCCTTCA TGACCTCACT CGCCGACGCC CGCGAGCTCG CGCGGACGAT GGTCGGCCTG
GGCGCCCGCG CCGGGGTGCG CACCGAGGCC CTGCTCACCG CGATGGACAC CCCGCTGGGC
CGCACCGCGG GCAACGGCCC CGAGGTGACC GAGGCGGTCG AGACCCTGCG CGGGGCGGGC
CCGTCGGACC TCGTCGAGGT GACCGTCGCC CTCGCCCGCG TCATGCTGGA CATCGTCGGC
CTCAGCGGTG GTTCCGGTGC CGCGCCGGAT CCGGCGGAGG TCCTGGCCTC CGGGGCGGCA
TACGACGTGT GGCGCGCGAT GGTCGCCGCC CAGGGCGGCG ATCCGGACGC CCCGCTGCCG
ACCGCCGCGT TCACCCGCAC TGTCTCCGCT CCGGCGGACG GCTACCTGAG CCGCCTCGAC
GCCCGCGCCC TGGGCATCGC CGCCTGGCGC CTGGGCGCGG GGCGCGCGCG GAAGGAAGAT
CCCGTCTCGC CGGCAGCTGG ACTGCGTTGG CTGGCGGCGG TGGGGGAGCA GGTCCAGGCC
GGCGCCCCCC TGATCGAGCT CTACTCCGAC GACGAGGCGA CCTTCCCCCG CGCGCTCTCC
GCACTCGCGG ACGCCGTCTC GGTCACCGAC GAGCCGCCCC CTCCCACCCC CCTGATCCTC
GACCACATCC GCTGA
 
Protein sequence
MSADAPGSDH GGPAASHDVV DLIRAKRDGA ALPADAVAWL IDAYTHGRVA DEQMSAYLMA 
VVWRGMASDE LDHWTSAMIA SGERLDLSGL TRPTVDKHST GGVGDKVSLV LAPLVAACGA
AVPQLSGRGL GHTGGTLDKM EAIPGWRADL DPGTMRAVLA DVGAVICAAG PGLAPADRRL
YALRDVTGTV ESIPLIASSI MSKKIAEGTS ALVLDVKVGA GAFMTSLADA RELARTMVGL
GARAGVRTEA LLTAMDTPLG RTAGNGPEVT EAVETLRGAG PSDLVEVTVA LARVMLDIVG
LSGGSGAAPD PAEVLASGAA YDVWRAMVAA QGGDPDAPLP TAAFTRTVSA PADGYLSRLD
ARALGIAAWR LGAGRARKED PVSPAAGLRW LAAVGEQVQA GAPLIELYSD DEATFPRALS
ALADAVSVTD EPPPPTPLIL DHIR