Gene Francci3_3416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3416 
Symbol 
ID3905656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4060598 
End bp4061644 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content69% 
IMG OID637880739 
Productaldo/keto reductase 
Protein accessionYP_482499 
Protein GI86742099 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTACC GCAAACTCGC CCGGACCGGC GTCGAGGTCA GCACCCAGTG TCTCGGCGCC 
ATGATGTTCG GCGCGTTCGG CAACACCGAC CACGACGAAT GCGAACGCAT CATCCATCGG
GCCCTCGACG CCGGCATCAA CTTCCTCGAC ACCGCCGACA TCTACTCTCA GGGTGAGAGC
GAGGAGATCG TCGGCCGGGC CATCAAGACC CGCCGCGACG ACGTTGTCCT TGCCACGAAG
TGCTTCAGCC CCATGGGCGA CGAGCGCAAC TCCCGCGGCG GCTCGCGGCG CTGGATCATC
CGCGCCGTCG AGGATAGCCT GCGCCGCCTC GGCACCGACT ACATCGACCT CTACCAGGTC
CACCGCCACG ACTGGGACAC CGACCTCGAG GACACCCTCG GCGCCCTGTC CGACCTCGTC
CACGCCGGCA AGGTCCGCTA CCTCGGCTCC TCCTCCTTCC CCGCCGACTG GATCGTCGAA
GCCCAGTGGG CCGCCCGGCG CCGTAACAGC GAACGGTTCG TCTGCGAACA GCCCCAGTAC
TCCATCTTCG CCCGCTCCAT CGAAGAAGCC GTCCTGCCCG CCGCCCAACG CCACACCATC
GGCATCATCC CCTGGAGCCC GCTGGCCGGC GGCTGGCTCA CCGGCAAGTA CCGGCGCGGC
GAGCAGGCTC CGGCCGGATC GCGCTACTCG TCACAGGGCT TGCTCGGCCG CCGCCAGGGT
CAGGCACTCT CCGAGGACCC GCACGCACCC GCCCGCTTCA CCGTGGTGGA GGAGCTGACC
ACCCTCGCCA AGCAGGCCGG CGTGTCCCTC ACCCACCTCG CGCTGGCCTT CGTCGACAGC
CACCCCGCCG TCACCTCAAC CATCATCGGG CCCAAGACGC TCACTCAGCT CGACGACGTC
CTCCTGGCGG CCGAGATCAC CCTCGACCAG GCGACACTCG ACGCGATCGA CGAGATCGTC
ACCCCCGGCA CCGACATCGC CGGCACCAAC CACCACAGCG GTAACCCGGC ACTGCGGCCG
AGTTGCCGGC GTCGGACGGC CCACTGA
 
Protein sequence
MKYRKLARTG VEVSTQCLGA MMFGAFGNTD HDECERIIHR ALDAGINFLD TADIYSQGES 
EEIVGRAIKT RRDDVVLATK CFSPMGDERN SRGGSRRWII RAVEDSLRRL GTDYIDLYQV
HRHDWDTDLE DTLGALSDLV HAGKVRYLGS SSFPADWIVE AQWAARRRNS ERFVCEQPQY
SIFARSIEEA VLPAAQRHTI GIIPWSPLAG GWLTGKYRRG EQAPAGSRYS SQGLLGRRQG
QALSEDPHAP ARFTVVEELT TLAKQAGVSL THLALAFVDS HPAVTSTIIG PKTLTQLDDV
LLAAEITLDQ ATLDAIDEIV TPGTDIAGTN HHSGNPALRP SCRRRTAH