Gene Francci3_2465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2465 
Symbol 
ID3905077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2905915 
End bp2907402 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content66% 
IMG OID637879795 
Producttryptophan halogenase 
Protein accessionYP_481561 
Protein GI86741161 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.842716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.17253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGATT CTGCGGAATT TGATGTGGTG GTCGTTGGCG GCGGACCCGC CGGCTCCACA 
CTGGCCGCGC TGGTGGCCAT GCAGGGGCAT CGAGTCCTTG TCCTGGAGAA GGAGCACTTT
CCGCGCTACC AGATCGGCGA GTCGCTGCTA CCATCCACTA TCCACGGGGT CTGCCGGCTG
ACCGGCGCCG CCGACGAACT GGCCAAAGCC GGCTTCCCGC TCAAGCGCGG CGGTACCTTC
AGATGGGGGG CCACCCCGGA GCCGTGGACG TTCGCCTTCT CGGTGTCGTC GCGGATGGCT
GGGCCGACCT CATTCGCCTA TCAGGTTGAA CGGTCGAAAT TCGACGAGAT TCTACTGCGG
AACGCCCGCC GGGTCGGCGC CGAGGTACAC GAGGGCTGCT CGGCCACCGA CGTCATCGAG
GACGGCGACC GGGTCGTCGG CATCCGCTAC ACCGACGACG GCGGCAACCG GCGTGAGGCG
CGGGCCTCCT TCGTGGTCGA CGCCACCGGC AACAAAAGCC GCATCTACCA TCGGGTTGGT
GGCACCCGGC AGTACTCGGA GTTCTTTCGC AGCCTGGCCC TGTTCGGCTA CTTCGAGGGC
GGCCGGCGGA TGCCCGAGCC CAACCGGAAC AACATCCTGT GTGTGGCCTT CGACAGCGGC
TGGTTCTGGT ACATCCCACT GAGCGACACG CTGACCAGCG TCGGCGCGGT CGTACGGTCG
GAAATGGCGG AGAAGGTCCA GGGTGACTCC GAGCAGGCCA TGAAGGCGCT CATTGAGGAG
TGCCCGATGA TTTCGGATTA CCTCGCGCCG GCCAGGCGGG TCACCACCGG GCAGTACGGC
CAGCTCCGGG TACGCAAGGA CTACTCCTAT CATCAGACGA CTTTCTGGCG TCCCGGGATG
GTTTTGGTCG GCGACGCCGC GTGCTTTGTG GACCCGGTGT TCTCCTCCGG CGTGCACCTC
GCGACCTACA GCGCGCTGCT CGCGGCCCGG TCCATCAACA GCGTCCTCGC CGAGATTGTG
GACGAGAAGA CCGCGATGCA GGAGTTCGAG GCCCGCTACC GCCGAGATTA CGGCGTGTTC
TACGAGTTTC TGGTGTCGTT CTACGAGATG CATCACAGCG AGGACTCCTA CTTCTGGCAG
GCCAAGAAGG TCACCGGGAA CAGCCAGCCC GAGCTGGAGG CATTCGTCGA GCTGATCGGC
GGAGTGTCGT CGGGGGAATC CGCGCTGACC GACGCCGACG CCCTGGCCCT CCGGCTGCAG
GCCAACACCG CCGACTTCAC TACCGCGGTC GACGCGCTCG TGGCCAACAA CAGCGAGAGC
ATGGTGCCGT TCATGAAGTC GCAGGTGATC CGCGGGGTCA TGCACGAGGG CTCGCAGATG
CAGATGCGCG CGCTGCTCGG TGAGGACGCC GAGCCGGAGA CCCCGCTGTT CCCCGGCGGT
CTGGTCTCGT CGGCTGACGG CATGTTCTGG CTGCCCACCG ACGCTTAG
 
Protein sequence
MTDSAEFDVV VVGGGPAGST LAALVAMQGH RVLVLEKEHF PRYQIGESLL PSTIHGVCRL 
TGAADELAKA GFPLKRGGTF RWGATPEPWT FAFSVSSRMA GPTSFAYQVE RSKFDEILLR
NARRVGAEVH EGCSATDVIE DGDRVVGIRY TDDGGNRREA RASFVVDATG NKSRIYHRVG
GTRQYSEFFR SLALFGYFEG GRRMPEPNRN NILCVAFDSG WFWYIPLSDT LTSVGAVVRS
EMAEKVQGDS EQAMKALIEE CPMISDYLAP ARRVTTGQYG QLRVRKDYSY HQTTFWRPGM
VLVGDAACFV DPVFSSGVHL ATYSALLAAR SINSVLAEIV DEKTAMQEFE ARYRRDYGVF
YEFLVSFYEM HHSEDSYFWQ AKKVTGNSQP ELEAFVELIG GVSSGESALT DADALALRLQ
ANTADFTTAV DALVANNSES MVPFMKSQVI RGVMHEGSQM QMRALLGEDA EPETPLFPGG
LVSSADGMFW LPTDA