Gene Franean1_7233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7233 
Symbol 
ID5675534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8831178 
End bp8832407 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content75% 
IMG OID641246070 
Productdeoxyribodipyrimidine photo-lyase 
Protein accessionYP_001511458 
Protein GI158318950 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.172483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.496846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGGTGGC TTCGCCGTGA CCTCCGCCTC GACGACAACC CGGCGCTGCT CGCCGCCGCC 
GAGTCCGGCC GGGTGCTGGC TCTCTTCGTC CTCGACGACG CGCTGCGGCG CCCGTCCGGT
CCCGTCCGCC TGGCGTTCCT CCACCGGTGC CTGCGCGACC TGGACGCCCA GCTCGGCGGC
CGGCTGTGCG TGCGCACCGG CTCGCCCTAC GCGGTAGCGC CCGGCCGCCT GCGCAAGGCC
GACGGGACGT CCTACCGGGT GTTCACGCCC TTCTACCGGG CCTGGAAGGA ACACGGCTGG
CGCGGGCCGG CCATCCCGGC GGATCCGGTC TGGCTCCAGC CCGCCGACCT CGACGGCGGC
AGCGAACCGA TCCCGGCGGA TCCGGAGCTC GGGGGCACCG AGCTACCCCC GGCCGGCGAA
CACGCCGCGC ACGAGCGCCT GCGCGCCTTC CTGACCGAAT CGCTGGCCGG CTACGCGGCG
CACCGCGACG AGCCGGCCGC GGCCAGCGAA TCGGGAGACG CCGTACCCGG CTGGTCCGGG
GCCAGCCGGG CTGGCGGGGC CGGTGGCGCT GGCGGGGCTG GCGGGGCCGA TGCGGCTGGC
GGGGGCGGCG GGCTGGCCGG TTCGGCCGAG AAGTTCCGCT CCGAGCTCGC CTGGCGGGAG
TTCTACGCGG ACGTCCTCGC CGGCACCCCC TCGTCGGCCC GGACCGACCT CACCGACACC
CTGGCCGCGT TGGCCTACGA GCCTCCCGGT GACACCTTCG AGGCGTGGAA GTGGGGCCGC
ACCGGTTACC CGATCGTCGA CGCCGGCATG CGCCAGCTCC TCGCCGAGGG CTGGGTGCAC
AACCGGGTCC GGATGATCGA GGCCTCGTTC GTCTGCAAGG ACCTGAACGT CCACCGGACG
CACGGCGCCC GCTGGTACCT CGAGCGCCTC GTCGACGGCG ACCTCGCGTC CAACAACCAC
GGCTGGCAGT GGACGGCCGG CACCGGAACC GACGCCGCCC CGTACTTCCG GGTCTTCAAC
CCCGTCTCGC AGGGGCGCAA GTTCGATCCC GCCGGGGAGT ACATCCGCCG ATGGGTCCCC
GAACTGCGCG GCCTCCCGCC CGACGCGGTG CACGAGCCGT GGAAACTCCC GGCCGGCCCG
CCGAACGGCT ACCCACGCCC AGTCGTCGAT CACGCTGTCG AACGCCGGGA GGCCCTCGAC
CGCCACGCCC GAGCCCGACA CCACGACTAA
 
Protein sequence
MWWLRRDLRL DDNPALLAAA ESGRVLALFV LDDALRRPSG PVRLAFLHRC LRDLDAQLGG 
RLCVRTGSPY AVAPGRLRKA DGTSYRVFTP FYRAWKEHGW RGPAIPADPV WLQPADLDGG
SEPIPADPEL GGTELPPAGE HAAHERLRAF LTESLAGYAA HRDEPAAASE SGDAVPGWSG
ASRAGGAGGA GGAGGADAAG GGGGLAGSAE KFRSELAWRE FYADVLAGTP SSARTDLTDT
LAALAYEPPG DTFEAWKWGR TGYPIVDAGM RQLLAEGWVH NRVRMIEASF VCKDLNVHRT
HGARWYLERL VDGDLASNNH GWQWTAGTGT DAAPYFRVFN PVSQGRKFDP AGEYIRRWVP
ELRGLPPDAV HEPWKLPAGP PNGYPRPVVD HAVERREALD RHARARHHD