Gene Francci3_0431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0431 
Symbol 
ID3903620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp513784 
End bp514575 
Gene Length792 bp 
Protein Length263 aa 
Translation table11 
GC content70% 
IMG OID637877763 
Product3-amino-5-hydroxybenoic acid synthesis related 
Protein accessionYP_479547 
Protein GI86739147 
COG category[R] General function prediction only 
COG ID[COG0546] Predicted phosphatases 
TIGRFAM ID[TIGR01454] 3-amino-5-hydroxybenoic acid synthesis related protein
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAACTG AGCTGAAACC CTGGTCGGCC ATGACCACCA CCGGGACCGG TGCGGGGACG 
ACCATCGGGA CCGGTGCGGC GTCCGGAACA CGGAAACGTG CGGTCATCTT CGATCTCGAC
GGGGTGCTGG TCGACAGCTT CGGAGTGATG CGGCAGGCGT TCACGATCGC CTACGCCGAG
GTCGTGGGTC CCGGCGAGCC CCCGTTCGAG CAGTACAACC GGCACCTGGG ACGCTACTTC
CCGGACATCA TGCGGATTAT GAATTTGCCG CTGGAGATGG AGGAGCCGTT CGTCCGCGAG
AGCTATCGGC TGGCCCACCA GGTGGTGCTC TTCGAAGGGG TCGCCGCGCT GCTGCGGAGC
CTGCACGAGC GCGGCCTCGG GCTCGCGGTC GCCACCGGCA AGGCGGGTCC ACGGGCCCGG
CACCTGCTGG GTGAGCTCGA CATTCTGCCG ATCTTCGATT ATGTCATCGG CTCTGATGAG
ATCGCCCGTC CCAAGCCGGC GCCGGACATC GTGAACCGGG CACTTGAGCT GTTGGGCGTG
GCGCCGAATG AGGCGATGAT GATCGGGGAC GCGGTCGCCG ACCTGGAGAG CGCGCGTGCG
GCCGGCGTGA TGGCGGTGGC CGCGCTGTGG GGCGAGGCGG ACGGCGCCGA GCTGGTCGCG
GCCGGGCCGG ACGCGGTGCT GCGCCGGCCC GCGGAGTTGC TGTCCCTTCT CCGGCCGCAC
CCGGTCGAAC CCGCCGGCAC CGCCGGTGTC CCCGCGGCTC AGGCCAGCCG GCCGATCCTC
GCCGGCGACT GA
 
Protein sequence
MGTELKPWSA MTTTGTGAGT TIGTGAASGT RKRAVIFDLD GVLVDSFGVM RQAFTIAYAE 
VVGPGEPPFE QYNRHLGRYF PDIMRIMNLP LEMEEPFVRE SYRLAHQVVL FEGVAALLRS
LHERGLGLAV ATGKAGPRAR HLLGELDILP IFDYVIGSDE IARPKPAPDI VNRALELLGV
APNEAMMIGD AVADLESARA AGVMAVAALW GEADGAELVA AGPDAVLRRP AELLSLLRPH
PVEPAGTAGV PAAQASRPIL AGD