Gene Franean1_0733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0733 
Symbol 
ID5669149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp852798 
End bp854588 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content71% 
IMG OID641239660 
Productxanthine/uracil/vitamin C permease 
Protein accessionYP_001505097 
Protein GI158312589 
COG category[R] General function prediction only 
COG ID[COG2252] Permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00949342 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAAAA TCGGAAGCCC GCCGGCCGAG CGGTCGGTCA ACCTGCCCTA CTGGACGAAG 
GGCGACACCA ACGCGTTCTT CGGCCTGGGC ATCAACGTCC TGGTCAACGT CATCGTCCTG
ACATCGCTGT GCCTGTTCGT GGTGAACATC CCCAAGGGGG ACGTGTTCGG CGCGATCCTG
CCCGCGCTGG GCATCGCGAT GCTGCTCGGC AACTTCTTCT ACGCCTGGCT CGGCCGCCGG
CTGGCCCTCA AGGAGGGCCG TGGCGACGTC ACGGCCATGC CGTACGGGCC GAGCGTTCCG
CACATGTTCA TCGTCGTCTT CGTGATCATG TTGCCGATCT ACCTGCAGAC GAAGGACCCG
GTGGCCGCCT GGCAGGCGGG CCTCGCCTGG GCGTTCATCA TCGGGATCAT CGTCATGATC
GGCGCCTTCG TCGGCCCGAC GATCCGCCGC TACGCCCCGC GCGCGGCCAT GCTCGGCACG
CTCGCCGGCA TCTCCATCGC GTTCATCTCG ATGCGGCCGG CCGCGCAGAT GTGGGACGCC
GCCTGGATCG CGCTGCCGGT CTTCGGCCTG CTGCTCATCG GGCTGCTCAC CGACCTGAAG
CTGCCGTGGA ACCTGCCGAT CGGCGCGGTC GCGCTGCTGC TGGGGACGGC GATCGGCTGG
ATCGGCGGCT TCATGGACGC CCCCGCGGTC GGCGACGCGG CGAAGGACAT CGCTGTCTCG
CTGCCGACGT TCCACTTCGA CAAGCTGATC GACGGCCTGT CCGACATCTC GCCGCTGCTC
GCCACGGCCA TCCCGCTTGG GGTCTACAAC TTCACCGAGG GCATGACCAA CGTGGAGAGT
GCCGCGTCCG CCGGGGACAG CTACAACCTG CGGCCGATCC TGCTCGCCGA CGGCCTCGGC
GCGGTCGTCG GCGCGGCGCT GGGCTCCCCG TTCCCGCCCG CGGTCTACAT CGGCCATCCC
GGCTGGAAGG CGGCCGGCGG CCGGACGGGG TACTCGCTGG CGACCGGCGC CGTCATCGCG
CTGCTGTGCT TCCTGGGGAT GTTCAGCCTG CTCAACGCGG TGCTCCCGCT GCCGGCGATC
GTGCCGATCC TGCTCTACAT CGGGCTGCTG ATCGGTGCGC AGGCCTTCCA GGTGTCACCG
AAGGCGCACG GCGCCGCGGT GGTGGCGGCG ATCATCCCGA ACATCGCGTC CTGGGCGGCG
GGGCTCATCG ACAACACGGT GACCACCGCG GTCGGCGTGG CGTCCAACCT CAACCCGTCG
GTCCAGCTCA CCGTCACCGA CGACGATCTC GAGGCGAACA GCGTGCTGCT GCACGGGCTG
CACGTCCTCG GCGACGGGGC CGTCCTCGCC GGTCTGGTCC TGGGCACGAT CGTGGCGTTC
ATCATCGACA AGCGGTTCGT CCACGCGACG ATCGCGTCCG CGGCCGGCGC GGTGCTGGCG
TTCGTCGGCC TGATCCACGG CGAGAAGGTG GAGTGGAACG CCAGCGGCCA GGTCGCGCTG
GGCTATCTGT TCCTCGCGGT GGTCTGCGCG ATCTGGGCCC TGACGAAGCC CGCGCCGCGG
GTGCCCGACG CCGAGGAGAT CGAGCTGGAA CGGGTGCACG GCGTGCCCCC GCAGCGCTCC
CGCAGCGACG CCGCGCCGGC AGCCGTGCCG GAACCCGTGC CGACGGCGGT GCCGGCGGCC
GTGAACGGCG GGCGACCGGG CGCGGACGAG CCGTCCTCGG CTGAGCCCGC CACGGCTCAG
CCCGCCACGG CTCAGCCCGC GGCCGGGAAG CCGGCAGCAG CGACGTCCTG A
 
Protein sequence
MIKIGSPPAE RSVNLPYWTK GDTNAFFGLG INVLVNVIVL TSLCLFVVNI PKGDVFGAIL 
PALGIAMLLG NFFYAWLGRR LALKEGRGDV TAMPYGPSVP HMFIVVFVIM LPIYLQTKDP
VAAWQAGLAW AFIIGIIVMI GAFVGPTIRR YAPRAAMLGT LAGISIAFIS MRPAAQMWDA
AWIALPVFGL LLIGLLTDLK LPWNLPIGAV ALLLGTAIGW IGGFMDAPAV GDAAKDIAVS
LPTFHFDKLI DGLSDISPLL ATAIPLGVYN FTEGMTNVES AASAGDSYNL RPILLADGLG
AVVGAALGSP FPPAVYIGHP GWKAAGGRTG YSLATGAVIA LLCFLGMFSL LNAVLPLPAI
VPILLYIGLL IGAQAFQVSP KAHGAAVVAA IIPNIASWAA GLIDNTVTTA VGVASNLNPS
VQLTVTDDDL EANSVLLHGL HVLGDGAVLA GLVLGTIVAF IIDKRFVHAT IASAAGAVLA
FVGLIHGEKV EWNASGQVAL GYLFLAVVCA IWALTKPAPR VPDAEEIELE RVHGVPPQRS
RSDAAPAAVP EPVPTAVPAA VNGGRPGADE PSSAEPATAQ PATAQPAAGK PAAATS