Gene Francci3_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2065 
Symbol 
ID3904638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2428607 
End bp2429926 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content67% 
IMG OID637879401 
Producttryptophan halogenase 
Protein accessionYP_481167 
Protein GI86740767 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.215345 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGA ATACTGAAGT CGTCGACGTT GTGGTCATCG GCGCCGGGCC GGCAGGTGCG 
GCGGCGGCAG CGACGCTCGC GAAGGCCGGG CGTAGCGTGA TTGTGTTGGA ACGGCGGACC
TTTCCCCGTT TTCACATCGG CGAGTCCATG GTGCCGTTCG TCAACGCCGC GTTCGAGAAG
CTGGGAATAC TCGACCGGCT CAAGCAGCAG GGTTATGTCG CCAAGCATGG TGCCGAGTTC
TGCCAGGGAG AGGAGGACGA GAACGTCCGC GTTTCGTTCA CAACGCAGGG GCCCGGTCGC
CACCACGTGA CGTTCCAGGT AGAGCGGGCG CACCTGGACA ATATGCTCAT TCAGTTCGCG
GGCGAGTGCG GTGCGCGGGT GATCCACGAG GCGACGGTAC ACGACCTGAT AACAGAAGGC
GACCGGGTCG TCGGTGTGCG CTACGAACAC GACGGGACGG CTCGCGAGGT GCGGGCACAG
TACGTTCTCG ACGCCGGCGG GCGGGCCAGC AAGATCGCCA AGGCGTTCCG GCTGCGGAAG
CCGGTCGACC GACTGAAGAT GGTCGCGGTC TTCCGCCACC TCAAGGGCAT CGACGAGGCG
CGCAATCCCG GCTTCGAGGG CGACATCCAG GTGGGCGCCC ACGAGGACGG ATGGCTGTGG
GCCATTCCCA TCTGGCCCGA CACGATGAGC GTCGGCGCGG TCATGCCGCA ACAGGTGCTG
CGGTCCGGCG ATCCCGCCGC GCTGTTCGAC GAGCACGTGT CCCGGGTACG GCGCGTCCGG
GAGCGGGTCG CGGGCGCCCA TCCGGTGAGC GACGTGCAGA TCGAAACCGA CTACTGCTAC
TACTCGGACC AGGTCGCGGG GCCGGGCTGG TTCCTAGCCG GCGACGCGGG CTGCTTCTTC
GACCCGATCT TCTCCGGCGG TGTCTACCTC GCCACCTCCA CCGGCATCCG CGCCGGCGAG
TCCATCGACG CCGCCCTGCG GGAGCCGCTC CGCGCCGAGG AGCTCCAGAA CGAGTACCAG
CGGTTCTACA AGACCGGCTA CGACATGTAC GCCCGGCTGA TCTACATGTA CTACGAGGAG
CCGGATCCTG ACGCCTACCT GGCGTCCGTC GGACTCGACG ACTGCGGCGA CGCCTTCGCG
AGCAACAAGT GGGTCGTCCG CTTCCTCTGC GGGGACTTCT TCAACGCCCG GAACAAGCTC
GCCCAGGAGG TCGTCAAGGA GCGTCGCTGG GACACCTTCG CGCCGTTCGA GCGCGTCAGC
GAGTGCCCGT ACTACGCCGA ACTGAACGAG GCGGAGGACA GGGAGCCCGT CGAGGCGTAA
 
Protein sequence
MAENTEVVDV VVIGAGPAGA AAAATLAKAG RSVIVLERRT FPRFHIGESM VPFVNAAFEK 
LGILDRLKQQ GYVAKHGAEF CQGEEDENVR VSFTTQGPGR HHVTFQVERA HLDNMLIQFA
GECGARVIHE ATVHDLITEG DRVVGVRYEH DGTAREVRAQ YVLDAGGRAS KIAKAFRLRK
PVDRLKMVAV FRHLKGIDEA RNPGFEGDIQ VGAHEDGWLW AIPIWPDTMS VGAVMPQQVL
RSGDPAALFD EHVSRVRRVR ERVAGAHPVS DVQIETDYCY YSDQVAGPGW FLAGDAGCFF
DPIFSGGVYL ATSTGIRAGE SIDAALREPL RAEELQNEYQ RFYKTGYDMY ARLIYMYYEE
PDPDAYLASV GLDDCGDAFA SNKWVVRFLC GDFFNARNKL AQEVVKERRW DTFAPFERVS
ECPYYAELNE AEDREPVEA