Gene Francci3_3163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3163 
Symbol 
ID3903885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3741845 
End bp3743194 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content73% 
IMG OID637880484 
ProductHAD family hydrolase 
Protein accessionYP_482249 
Protein GI86741849 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0117236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.303399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGC CGGCCGCCGT GCCCGCGCCG GCGCACGAGA CCACGTCAAC CAGCCCTTCG 
TCAGTCAGCC CTTCCGCGGT CTCCTCCGCT TCGTCGTCCT GCGGCTTGTC CGGTTCCTCT
GTGTCCTCTG TGTCCTCTGG TTTCTCCGTG TCTTCCGGCG GCTTCCCCGG GCCGTTGCGC
GGGACGGACC GGCCGCTGGC CGACCTGTTC GACGTCGCCC TGATGGATCT CGACGGTGTC
GTCAACCGTG GGGCCGCCGC CGTGCCACAC GCGGCCGGCA CCATCGCAGC CGCGGGCCGC
CGGGGGATGC GCACGGTGTA CGTCACGAAC AACGCACTGC GCCCGCCGGC CGAGGTCGCC
GCCCGCCTGC GCGGCTTCGG CGTGCCGGCG CAAACCGAGG ACGTCGTCAC CTCGGCGCAG
GCAGCGGCGC ACGTCCTGGC CGAACGGCTG GGCACCGGAT CCCGGGTGCT CATTACCGGA
GGGCGGGGAC TTCGACAGGC GGTGATGGAG GAGGGCCTGG TCCCGGTGGA CTCGGCCGAG
GACGATCCGG CGGCGGTGGT CCAGGGGTTC GACCCGGATC TCACCTATGC CCGCCTCGCC
GAGGCGGCCT ATGCCATCCG GGCCGGAGCA CTGTGGATCG CCAGCAACGC CGATCGCACC
GTGCCAACCG AGCGGGGCGT CGCGCCTGGT AACGGATCCG TCATCGCCTT CCTGCGGGCC
GCTACGGACC GCGAGCCGGT GGTGACCGGC AAGCCCGAGT CGGCGATGCA CCGCGAGTCG
ATGCGGCGCA GCGGAGCCCG GATACCCCTC ATCGTCGGTG ACCGGCTCGA CACCGACATC
GAGGCCGGTC ACCGGACGTC GACGCCGACC CTGCTCGTCT TCACCGGGGT GACGACCCCC
GGAGACCTGC TCGCAGCTCC CGCTCCGCAC CGTCCCGACT TCCTCGCCGC GGACCTGCGC
GGGCTGCTCC GGGCGGCGCC ACCGGTGGAG GCCGTCCCGG AGCTGGGGAA CCACGCCTAT
CGTTGCGGCG CCTGGAGCAG CCGGGTCGAG GAGGGGACGC TGCACTGGTC GGGTGAAAGC
TACGGGCTGG GCGATGACGC GTCGGATGGC GATCGTTCAG GCGGCGGTCA TTCCGGTGGG
GATGCGTCAG ACCGGGATAT GTCGGGTGAC GACACGTGGA ACGGGGACGG CACCGACGGG
CTGGACGGGC TGCGGGCCGC CTGCGCCACG GTCTGGTCCG CCCTGGACAG CGGCGTCGCC
GTCCACGCGA TCGCGGCGCG ACGCCCACCG GGTTGCGAGG ACCTGATGGC TCCGGCGCCC
CGGGCACCGG AGCCCGTGGC GCTCGCCTGA
 
Protein sequence
MTRPAAVPAP AHETTSTSPS SVSPSAVSSA SSSCGLSGSS VSSVSSGFSV SSGGFPGPLR 
GTDRPLADLF DVALMDLDGV VNRGAAAVPH AAGTIAAAGR RGMRTVYVTN NALRPPAEVA
ARLRGFGVPA QTEDVVTSAQ AAAHVLAERL GTGSRVLITG GRGLRQAVME EGLVPVDSAE
DDPAAVVQGF DPDLTYARLA EAAYAIRAGA LWIASNADRT VPTERGVAPG NGSVIAFLRA
ATDREPVVTG KPESAMHRES MRRSGARIPL IVGDRLDTDI EAGHRTSTPT LLVFTGVTTP
GDLLAAPAPH RPDFLAADLR GLLRAAPPVE AVPELGNHAY RCGAWSSRVE EGTLHWSGES
YGLGDDASDG DRSGGGHSGG DASDRDMSGD DTWNGDGTDG LDGLRAACAT VWSALDSGVA
VHAIAARRPP GCEDLMAPAP RAPEPVALA