Gene Franean1_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1963 
Symbol 
ID5670364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2359471 
End bp2360640 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content73% 
IMG OID641240884 
Productradical SAM domain-containing protein 
Protein accessionYP_001506306 
Protein GI158313798 
COG category[L] Replication, recombination and repair 
COG ID[COG1533] DNA repair photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.003672 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACG TGTGTTCTAA TGGCGGGGTG CGGTGGGAGA ACCTGAGCCT CGTCGAGTCC 
GCGGACACCC CGGTGCGCGG GCTGCTCGAC CGCGCCGCGG TGACCCGGAC GTTCGACACC
CCGGGCTTCG CCGGAATGAC CTTCTACGAG ATCCACGCGC GGTCCGCGCT CAATCGCGTC
CCGGCGGTAT CGCGCGTCCC GTTCCGGTGG ACGCTGAACC CCTACCGGGG CTGCTCACAC
GCCTGCCGAT ACTGCTTCGC CCGCAACACC CACTCCTACC TCGACCTCGA CACCGGTCTC
GACTTCGATT CCAGGATCGT CGTCAAGGTC AACGTGGCCG AGCGGCTACG GGCCGAGCTG
GCCGCGCCGA AGTGGCGCGG CGAGTCCGTG GCGATGGGTG CGAACGTCGA CCCTTATCAG
CGGGTGGAAG GGCGCTACCA GCTCATGCGG GGCGTGCTCG GCGTCCTGCG CGACGCGGCG
AACCCGTTCT CGATCCTCAC CAAGGGCACC CTGATCCTGC GTGACCTCGA CCTGCTGGCC
GAGGCCGCCG CCGTCACCGA GGTCCGCGTG GCGGTCTCGG TCGGTTTCGT CGACGACGAC
CTGTGGCGCA CGGTCGAGCC GGGAGCCCCC CGCCCGGAAC GCCGGCTGGA GGTCTGTGCG
GCGCTCGGCG CCGCCGGCAT CGAGTGCGGG GTGCTGATGG CGCCCGTACT CCCCGGCCTG
AGTGATTCGC CGGCGGCGCT GGAGCGCGCG GTGCGCCGCA TCGCCGAGGC GGGTGCGGCC
AACGTGACCC CGATCGTGCT GCACCTGCGG CCCGGGGCGC GGGAGTGGTA CCTCGGCTGG
CTCGGGGAGC ATCACTCCGA CCTCCTCCCT CTGTACCGAT CCCTCTACGG CGGGGGCTCC
TACGCGCCGC GGGCCTACAG CGAGCGGATC TCCGCCTTGG TCCGGGACCT GGCCCGCCGG
TACGGCATCG CCGGTGCCGC CGCGCCGGCC GCCTCACCGG CCTCCTCTCG GTGGGCGCCG
GCGGAGGCGC GTGGGGTGAG TGGGGTGCGG GGTGGGCGTG CGGCGGTCAT CCACCGTGGT
CCGACACGTG CCGTTACGGC GGTGTCCGCG GCGGTGGCCG GCCGGGCGCC GCTGGAGGAG
CAGCTCCTCC TGCCCGGCTT CGGTTCCTGA
 
Protein sequence
MSNVCSNGGV RWENLSLVES ADTPVRGLLD RAAVTRTFDT PGFAGMTFYE IHARSALNRV 
PAVSRVPFRW TLNPYRGCSH ACRYCFARNT HSYLDLDTGL DFDSRIVVKV NVAERLRAEL
AAPKWRGESV AMGANVDPYQ RVEGRYQLMR GVLGVLRDAA NPFSILTKGT LILRDLDLLA
EAAAVTEVRV AVSVGFVDDD LWRTVEPGAP RPERRLEVCA ALGAAGIECG VLMAPVLPGL
SDSPAALERA VRRIAEAGAA NVTPIVLHLR PGAREWYLGW LGEHHSDLLP LYRSLYGGGS
YAPRAYSERI SALVRDLARR YGIAGAAAPA ASPASSRWAP AEARGVSGVR GGRAAVIHRG
PTRAVTAVSA AVAGRAPLEE QLLLPGFGS