Gene Francci3_2756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2756 
Symbol 
ID3906467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3246852 
End bp3248708 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content68% 
IMG OID637880079 
Producttryptophan halogenase 
Protein accessionYP_481845 
Protein GI86741445 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGATC ATTACGACGT TATTATCGCC GGCGGCGGGC CGGCGGGCTC TACACTGGCC 
GCGCTGCTCG CCCGCACGTC GGACCTGAAA GTAGCGATCT TCGAGAAGGA TGAGTTCCCG
CGCGAGCACA TCGGCGAGTC GTTCGCGCAC CCGTTGATCC CGGTGCTGGC AGAGAGCGGA
GCGCTCGCGA AGGTGCTGGC CAGCAACTGC TGGGTAAAGA AATTTGGCGG TATCTACAGC
TGGGCGCGGC AGGGTCCGAG CCGGGCGTTC TTCGACCACG CGAACTGGGC GGTGGACGGG
GTACACCGAT GGGCACTGCA CGTCAACCGT TCAGAGTTCG ACCAGATCCT GCTGGAACAC
GCCCGGGACC TCGGAGTCGA CGTCACCACC GGGATGGCCG TCACCGACTT CGCCGCGGCC
GCCAACGGCT GTCAGGTGAC GCTCGCCGAC GGCACCGCCG TGTCCGGCGC GTACTTCGTC
GATGCCTCCG GTCGCCAGCA GAGCCTGGTC ACCAAAAAGC CTCGCGAATG GTTGTCCGGC
TACCGGAACA TCGCGATCTG GCAGCACTAC CTGGGCGGCC TGCCGGCCCA GGGGCTCGAC
GGCGACTGGA ACATCTTCCG GGAGAAGAAC CTCTCGCCGA TCGGTTGCTT CGCGTTTCCC
GACGGCTGGT GCTGGTACAT CCCCGTCCCC AGGATCGTGA ACGGGGAGCG GGTGCTCACG
CACTCGATCG GCATCGTGAC GAGCCCGGAG GTGCTGAAGG AACCCGGGAA GGACTTCACG
GACTCCGAGG TCTTTCTGCG CACCGTCCGC GGCGTGCCGC GGCTGGCCGA CCTGGTGGCC
GAGGTGACGC CGATCTCCGA CCAGATGATG ACGGTCACCA ACTACTCCCG GGTCAATGAG
CGCTTCGCCG ACCTCGACCG GCACTGGATC CTGATCGGCG ATGCCTCCTA CTTCGTGGAC
CCCCTCTTCT CGTCCGGGGT CGCGTTCGCG GCCAACCAGG CGGCCTCGGC GGCGCTGCTG
CTGCGCACCA CGCTGCGCGC CGAACTCTCC CCGGGTCTGG TGAGAGATCT GTGGCAGGAC
TACGACCACG AGTGGCACGG AATGGCCGAG GTGTTCGCGC TCTCGATTGA TCAGTGGTAT
CACATGATCG GCGCGGACAA CCCGGGCAGC GCATACTGGC ACCGGCGCAA TTCGAGTCCG
CATCTGGACA TGCCCGATCG GTCCTTCGAC GCGCTGCTCA ACACGGCGTT CACCCCCGAC
CTGCTCCTGA TCATGACGCG CGGCACCGGC CGAATGTCCG ACCTGGCGAT CGACGGTCCG
TACCAGCAGG CTCGCGCGCA CGTGATGCTG ACGGAGCCGG AACCGGACGC CGTGCTGGTC
GCCGCTCCCG GAGTCCGGAT GCGGGCCGGC GTGGCGCTGG ATGTCCCCGG CTTCAAAGCG
GTGCTCCCAC CGGCCGACCT CGAACTGGAC ACGCCCGCCG CGGTACGGGC CGCCGTCGCC
GAGTACTGGA CCGATCCGGT GGCGGCCGAG GCGAACGGCG GCCTCGGCGT GCCTTCTCCG
ACCGCCTCCC CAGTACCGTG CCACCGTTTC GAGTTCGATT CCGACGCACT CGTCGATTCC
GGCTCAGGCG CGACGGGGTT TTCGGTGCGC GGGGTGGACA GTCACGACGG CGCACCGCAG
CTGTGGGAGA TACTCAGCCG CGGTCCGGTC GTCTACGGCG AGCTCGGCTC GCGGCTCGCC
CCCGGTCAGC GGGTGCTGCT GCAGCGGCTG ATAAAGGCCG GAATGGTCAC CGTCAAGACC
GCGGCGAGGC AGACCACGCC GGGGACTGAA GCCGCCGAGG TCACCCTCGC GGACTGA
 
Protein sequence
MRDHYDVIIA GGGPAGSTLA ALLARTSDLK VAIFEKDEFP REHIGESFAH PLIPVLAESG 
ALAKVLASNC WVKKFGGIYS WARQGPSRAF FDHANWAVDG VHRWALHVNR SEFDQILLEH
ARDLGVDVTT GMAVTDFAAA ANGCQVTLAD GTAVSGAYFV DASGRQQSLV TKKPREWLSG
YRNIAIWQHY LGGLPAQGLD GDWNIFREKN LSPIGCFAFP DGWCWYIPVP RIVNGERVLT
HSIGIVTSPE VLKEPGKDFT DSEVFLRTVR GVPRLADLVA EVTPISDQMM TVTNYSRVNE
RFADLDRHWI LIGDASYFVD PLFSSGVAFA ANQAASAALL LRTTLRAELS PGLVRDLWQD
YDHEWHGMAE VFALSIDQWY HMIGADNPGS AYWHRRNSSP HLDMPDRSFD ALLNTAFTPD
LLLIMTRGTG RMSDLAIDGP YQQARAHVML TEPEPDAVLV AAPGVRMRAG VALDVPGFKA
VLPPADLELD TPAAVRAAVA EYWTDPVAAE ANGGLGVPSP TASPVPCHRF EFDSDALVDS
GSGATGFSVR GVDSHDGAPQ LWEILSRGPV VYGELGSRLA PGQRVLLQRL IKAGMVTVKT
AARQTTPGTE AAEVTLAD