Gene Franean1_6785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6785 
Symbol 
ID5675098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8267263 
End bp8268822 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content74% 
IMG OID641245634 
ProductCHAD domain-containing protein 
Protein accessionYP_001511025 
Protein GI158318517 
COG category[S] Function unknown 
COG ID[COG5607] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACAC CGGCCCACGG GCATCGGGAG GTAGAGACCA AATTCGACGT GAGTTCGACG 
TTCGTCGTCC CGGTCCTGAC CGGTCTTGCC GGGGTCGCCT CGACCGCCGG GCCGACGGAG
GAGCATCTCG ACGCCGTCTA CTACGACACC GAAGATCTGC GGCTTGCCCG CAACCGCATC
ACGCTGCGAC GGCGCCAGGG CGGGCATGAC GCCGGCTGGC ATCTCAAGCT CCAGAGCTCG
GGGGCCGGCC GGGACGAGAT CAGCCGTCCG CTGGGCGCGA TCGAGCGGGA CCCCTCCGCC
GAGGGCACTG TCCCGGGCGA GTTCGCCGAC CTCGTGGCCG CCACCACCCG GGGCAGGCCC
CTGGCCCCCG TCGCGCGGGT GCGGACCGTC CGCCGCGCGA CCACTCTGCG CGCCCCCGAC
GGACGGGACC TGGCGGAACT AGCCGACGAC GAGGTCCACG CCCAGACACT GGGCACGTCG
ACGACGCTGT CCCGCTGGCG GGAGATCGAG GTCGAGGCCC TCGGCGACGA CCTCGACGTC
CTGCCGGCGG CCGGGGCGGT GCTGTGTGAG GCCGGGGCCC GGCCGGCCGC CGGGCCGTCC
AAGCTCGCCC GGGCACTCGG ATCGCGGGCG GCGCGGCCGG AACTCCCCGA GCTCCCCGCC
GACGGCGCCC CGGCCGGGGG AACCGCGGGT GAGGCCGTCC GCGGCTATCT GGCGACCCAT
ACCCGGGCCC TGCTGGCCGC TGACGCCCGC GTGCGCCTCG GCGATCCCGA GTCTGTTCAC
GACATGCGGG TCGCCGCCCG CCGGCTGCGC AGCGCCCTGC GCACGTTCCA GCGGCTGTTC
GACCCAGCAC CGGCGCGCGT GCTCCAGGCC CGGCTGCGGG AACTGAACCT CCTGCTCAAC
GCCGCCCGCG ACGGCGAGGT CCAGCTCGAG CGGTTCACCA CCGAGATCGA CGCGCTCGAC
GAACGGGACC TGCTGGGCCC CGTCGCCGCC CGCGTGCAGG GCCATCTGCG CGCACAGCAC
CTGCGCGGCC GGGAGCAGGC CCTGACCTGG ATGCGCGACG CGCAGTACCT GGATTTCCTC
GACGACCTGA TCGCCTTCGT CGTCGGACCA CCGTATTCCG CCCTTGGTCG CCGCCCGGCC
GGGCCGGCCC TGCGCTCCCC GATCCGCAAA GCCGACCGCA AGCTGCGCCG CCGGGTCGAT
CGAGCCCTCC GCACCCCCGC CGGCGACAGC CAGGACGTCG CCCTGCACGC CGCGCGGAAG
GCCGCGAAGC AGCTGCGCTA CGCGAGCGAG GCGGCCACGC CGGTCTACGG GGAACACGCC
GCGACGCACA CCAGGCGAGC CAAGAAAATC CAGAACAGCC TGGGTGAGCA CCAGGACTGC
GTCGTCGCCC AGGGCGTCCT GCGCGAGTTC GCGATCGCCG CCAACCAGGC CGGCGAATCC
TCGTTCACCT ACGGCCTTCT CCTCGGCGGC GAGCGGGAAC AGGCTCACCT GACCAGGGAT
GTCTTCGCCG CCCGCTGGCC GAAGCTCTCC CGCCGGCGCC ACCGCCGCTG GCTGCACTGA
 
Protein sequence
MRTPAHGHRE VETKFDVSST FVVPVLTGLA GVASTAGPTE EHLDAVYYDT EDLRLARNRI 
TLRRRQGGHD AGWHLKLQSS GAGRDEISRP LGAIERDPSA EGTVPGEFAD LVAATTRGRP
LAPVARVRTV RRATTLRAPD GRDLAELADD EVHAQTLGTS TTLSRWREIE VEALGDDLDV
LPAAGAVLCE AGARPAAGPS KLARALGSRA ARPELPELPA DGAPAGGTAG EAVRGYLATH
TRALLAADAR VRLGDPESVH DMRVAARRLR SALRTFQRLF DPAPARVLQA RLRELNLLLN
AARDGEVQLE RFTTEIDALD ERDLLGPVAA RVQGHLRAQH LRGREQALTW MRDAQYLDFL
DDLIAFVVGP PYSALGRRPA GPALRSPIRK ADRKLRRRVD RALRTPAGDS QDVALHAARK
AAKQLRYASE AATPVYGEHA ATHTRRAKKI QNSLGEHQDC VVAQGVLREF AIAANQAGES
SFTYGLLLGG EREQAHLTRD VFAARWPKLS RRRHRRWLH