Gene Francci3_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1098 
Symbol 
ID3905769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1309956 
End bp1311128 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content67% 
IMG OID637878431 
Productpurine phosphorylases family protein 1 
Protein accessionYP_480208 
Protein GI86739808 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0775] Nucleoside phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAAA CACACCCGAT CGTCGTTTTG ACCGCCCTTG ACCTGGAGTA TCAGGCGGTT 
CGCGAACATC TGGCGGATGC ACGACTGCAC CGCCATCCGC AGGGAACCCG CTTCGAGGTA
GGCCGGCTGG CCCGCGGCCG GTGCCGGGTG GCGCTTGCAC ATGTCGGCGT GGGAAACCAG
TCGGCCGCGG TACTCGCTGA ACGAGCCATA GCCGAGTTCA CGCCGGCGGC GCTGCTCTTT
GTCGGCGTCG CAGGCGCGCT TCATCGCCAT ATCGCACTGG GCGACGTCGT GGTGGCCACC
CATGTGTACG CCTTCCACGG CGCCACCAAC GATGACGAAG GGCTCTGGGG GCGACCACGC
ACCTGGCCGC TGTCGCACCG GGCCGACCAG ATCGCCCGCC ACCTCTATCG GACGAGGTCG
TGGGCACGAC CGTCGGTCGA AGCCGAGTCT CTGCCACAGG TGCACTTCGG GCCGATCGCG
GCAGGGGAGG TGGTGCTGAA CTCCACCGTG TCCGCTCTGG CCCGCTTGCT GCACGAACGT
TACAACGACG CGCTCGCCAT CGAGATGGAG GGCGCGGGAG CCAGCCAGGC CGGGTTGCTG
AACGACTCGC TGCCAGTGGT TGTAGTCCGC GGCATCAGTG ACCACGCCGA TGGCACCAAG
GAATTGACCG ACCGCCAGCT GTGGCAGCAG CGCGCTGTGG CAAACGCCGC TGCGTTCGCC
GCAGCGCTGG CCGAGGAACT GTCAACGGAC ATCGGACGGG TCGATGCCGC GGAACCCAGG
ATCGGGAGGA CACCCATCAT GCAGACACCG CACCAGAACA TCCGCATCAT CGCTTCGGAA
GGCGCGCAGG TCGGTGCGCA GACCGGAGTC GTGCACGGTG ACGTGCATAT CGGCGTCGCC
GGTGAGCGCG CTCGGGTCGA CCTGCCGACA GCACTCCTCC GCTTCCGCGC TCGTCTGGAC
GATGCACGCA CGGCCGGCGA TGTCGATGCT GAGACTTATG CCGCCGCCGA GGCTGAGCTC
CGCGAGGCCG ACAAAGCACT CCAGGCGGAT TCACCCGCTA CTCGCGGCGC CCTGCTGATG
GCGCTGAAGA AGGTCCGCGG ATTGGTCGGC GACGTCGCCG ACCTCGCCGC GAAGATCGGC
ATGGTGATCA TGCTCGCTCA AGGCGTGTCG TGA
 
Protein sequence
MGKTHPIVVL TALDLEYQAV REHLADARLH RHPQGTRFEV GRLARGRCRV ALAHVGVGNQ 
SAAVLAERAI AEFTPAALLF VGVAGALHRH IALGDVVVAT HVYAFHGATN DDEGLWGRPR
TWPLSHRADQ IARHLYRTRS WARPSVEAES LPQVHFGPIA AGEVVLNSTV SALARLLHER
YNDALAIEME GAGASQAGLL NDSLPVVVVR GISDHADGTK ELTDRQLWQQ RAVANAAAFA
AALAEELSTD IGRVDAAEPR IGRTPIMQTP HQNIRIIASE GAQVGAQTGV VHGDVHIGVA
GERARVDLPT ALLRFRARLD DARTAGDVDA ETYAAAEAEL READKALQAD SPATRGALLM
ALKKVRGLVG DVADLAAKIG MVIMLAQGVS