Gene Francci3_2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2189 
Symbol 
ID3906789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2562072 
End bp2563646 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content69% 
IMG OID637879521 
Productsulfatase 
Protein accessionYP_481287 
Protein GI86740887 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.2172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGT GGCGACGAGA GCACCACGAC TCCGCTTCCC GACGCCGTCC CCGCTCTCGA 
CGGCGGCTGG TGGCGGCCAC GGCCACGGCC ACGGCCTCGC TGGCGCTGGC GGCGGGCGCC
TGCGGCGGTT CGGCCGCGCC GGCCACAACG GCGAGCGCGG CCCGCCCGAA CATCGTCTTC
ATCCTCACGG ACGACCTGTC CTGGAACCTG GTCACCGACC AGATCGCGCC GCACATCACC
GCCCTGGAAA GGCAGGGCGA GACGTTCGAC CACTACTTCG TGACCGACTC GCTGTGCTGC
CCGTCCCGGT CGTCGATCTT CACCGGCCTC CTCCCTCACG ACACCAAGGT CGAGACCAAC
CTGTCCCCTG ACGGCGGCTA CGGGAAGTTC CAACAGGAGG GCCTCGCCGG CAGGACCTTC
GCCGTCGCGC TGCAGGCGGC GGGATACCAG ACCTCGATGC TCGGCAAGTA CCTCAACGGC
TACGGCGACC CCACCATCAC CCCGACCACC GGACCCGTCC CGCGCGGCTG GTCCGACTGG
CACGTCAGCA ACACCACCGG CTACGCGGAG CTCAACTTCG ACCAGAACGA CAACGGCGTC
GTGCGCCACT ACGCCGGCCA GGACAACTAC GGCGTGGACG TGCTCAACGC CGACGCCCAG
GCGTTCATCC GACGCTCGGC CGGGAAGCCG TTCGCCCTGG AGGTGGCTAC CTACGCCCCC
CACCAGCCGT ACACACCGGC GCCGCGCAAC GCGGACGACT TCCCCGGCCT GACCGAGCCA
CGCGACCCGT CCTTCAACAC CAACAACACC GACGCACCGG CCTGGCTCGG CCAACGCGCC
CCTCTCGCCC CGTCAGTGCT CACGAACCTG GACCAGGCCT ACCGCGAGCG TGCCCAGGCC
GTCGAGTCCG TCGACAAACT GGTAGGCGAC ACGGAGGCGA CGCTCGCCGC CGAGCACCTG
CTCGACAACA CCTACTTCGT CTTCAGCTCC GACAACGGCT ACCACCTCGG CCAGCACCGC
CTCGTCAGAG GCAAGCAGAC CGCCTTCGAC ACCGACATCC GCGTGCCCCT GATCGTCACC
GGGCCCGGCG TCCCCCACGG CCGGGTGATC TCCCAGGTCG CTCAGAACGT CGACCTTTAC
CCCACCTTCA CCGACCTCGC CGGCGCCACC CCGGCCAGGC CCGTTGACGG GCGCAGTCTC
GTCCCGCTGC TGCGCCCCGC GACGGAGCCG CCATCCTGGC GCACGATCGC GCTCGTCGAA
CACTTCGGCC AGGCGAGCGA CCCCGCCGAC CCCGACCACG AACCCGGCGG CAGCAACCCG
ACGACCTACG AGGCGATCCG GATCTCAGCG CCACACCTCG CGCACTTCGA CGGGCCAGTC
GAAGCGGTCT ACGTGGAGTA TAACGACTCC AAACACGAGA TCGAGTACTA CGACATCACG
AAAGACCCCT ACGAGATCAA CAACGTCGCG GGCGCGCTCA CCGGGGCGCA GCGCGCCGAA
CTACACACGG TCCTTGCCGG CCTCGGAAAC TGCCACACCC AGGCCGCCTG CGCCGCCGCC
GGTCTGCCGA TATGA
 
Protein sequence
MKLWRREHHD SASRRRPRSR RRLVAATATA TASLALAAGA CGGSAAPATT ASAARPNIVF 
ILTDDLSWNL VTDQIAPHIT ALERQGETFD HYFVTDSLCC PSRSSIFTGL LPHDTKVETN
LSPDGGYGKF QQEGLAGRTF AVALQAAGYQ TSMLGKYLNG YGDPTITPTT GPVPRGWSDW
HVSNTTGYAE LNFDQNDNGV VRHYAGQDNY GVDVLNADAQ AFIRRSAGKP FALEVATYAP
HQPYTPAPRN ADDFPGLTEP RDPSFNTNNT DAPAWLGQRA PLAPSVLTNL DQAYRERAQA
VESVDKLVGD TEATLAAEHL LDNTYFVFSS DNGYHLGQHR LVRGKQTAFD TDIRVPLIVT
GPGVPHGRVI SQVAQNVDLY PTFTDLAGAT PARPVDGRSL VPLLRPATEP PSWRTIALVE
HFGQASDPAD PDHEPGGSNP TTYEAIRISA PHLAHFDGPV EAVYVEYNDS KHEIEYYDIT
KDPYEINNVA GALTGAQRAE LHTVLAGLGN CHTQAACAAA GLPI