Gene Francci3_2498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2498 
Symbol 
ID3904876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2949056 
End bp2950384 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content75% 
IMG OID637879828 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_481594 
Protein GI86741194 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.486253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCG ACGCGCCGAC CGACGTGCCG ACCGGAAGCC CGGGAGCCGC GCCGGTCGAC 
GAGCGCGGAC GGCCGGGGCC GGACCGCCGT CGGCTGCTGG GCGGCGCCGG GCTGCTCGGC
CTCGGTGGCG GAGTCGTCCT CGGCACGGGG GTCGCCCTGG GCACGCAACG CCTGCTACCG
GACCGGCCGA ACCCGGTCGA GGCGGCTCGC CGCGCCGCGG TGGCCGCCGT GGCCGGTGAC
GGCCGTCACC AGCCCGGGAT CGCCGAACGG GCCCCCGCGC ATCTGATCTT CACCGCCTAC
GACCTGATCT CGACCGAGCC GACGGCGGTC CGATCCGCCC TCGCCGCGCT GCTGCGGACC
TGGACGTCGG CCTCGGCGGT GTTGATGCGC GGCGAACCGA CCGCCGGCGC CGAGCGGGAC
ACGGCGGGGC TCGGGCCGGC CGCCTTGACG GTCACCGTCG GGCTCGGCGC GTCAGCGTTG
CGCCGCGCCG GCCTCGACAC GCGGATTCCC GGTCCGCTCG CGGACATCCC TGCCCTGCCC
GGCGACCGGA TCGACGCGGC CCGTGGCGGC GGGGACCTCG CCGTGCAGGT CTGCGCGGAG
GACCCGATGG TCGCTTACTC GGCGGCCCGC CAACTGCGTC GCATCGCCGC TGCGTACGTC
CAGCCCCGCT GGGTGCAGCG AGGGTTCCAG CGCACCGCGG CCGGTGCCGC CGACCCCGAT
GCCACCCCGC GTAACCTGAT GGGGCAGGTC GACGGCACCG ACAATCCCCG GCCGGGCACG
GCACAGTTCG ACCTCGCGGT GTGGGCCGCC GACGGGCCGG CATGGATGCG AGGCGGGACC
TACGTCGTCT GCCGGCGGAT CCGCATGCTG CTCGACAGCT GGGACCGGCT GACCGAGGCG
GCGCAGAGCG AGGTCATCGG CCGGCGCAAG TCCGATGGCG CACCGCTGTC CGCCCCGCCC
GCCGACCAGG GCGGGGGCGA GACGACAGCG CCGGACTTCG CCGCCCGCAC TGCCGACGGC
CGGCCCGCGA TCGCCGTCAA CGCCCACATC CGACTGGCCC ATCCGCAGTT CCACGGCGGC
GTCGCGATGT TCCGCCGCGG CTACTCCTAC GACGACGGCC TCGACGCCAC CGGTGAACCG
GACGCGGGGC TGTTCTTCCA AGCCTACCAG GCCGACCCAC GTACCGCCTT CGTTCCCGTC
CAGCGCGCCC TCGCCGCCTC CGACGCGTTG AGCACCTTCA TCCGGCATAC CTCCAACGCC
CTGTTCGCCA TCCCACCCGC CCCGCCGCCG GGCGGCTTCC TCGCTCAGCA GCTCCTCGAT
GGCGCATGA
 
Protein sequence
MTGDAPTDVP TGSPGAAPVD ERGRPGPDRR RLLGGAGLLG LGGGVVLGTG VALGTQRLLP 
DRPNPVEAAR RAAVAAVAGD GRHQPGIAER APAHLIFTAY DLISTEPTAV RSALAALLRT
WTSASAVLMR GEPTAGAERD TAGLGPAALT VTVGLGASAL RRAGLDTRIP GPLADIPALP
GDRIDAARGG GDLAVQVCAE DPMVAYSAAR QLRRIAAAYV QPRWVQRGFQ RTAAGAADPD
ATPRNLMGQV DGTDNPRPGT AQFDLAVWAA DGPAWMRGGT YVVCRRIRML LDSWDRLTEA
AQSEVIGRRK SDGAPLSAPP ADQGGGETTA PDFAARTADG RPAIAVNAHI RLAHPQFHGG
VAMFRRGYSY DDGLDATGEP DAGLFFQAYQ ADPRTAFVPV QRALAASDAL STFIRHTSNA
LFAIPPAPPP GGFLAQQLLD GA