Gene Francci3_3683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3683 
Symbol 
ID3905367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4419288 
End bp4420277 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content75% 
IMG OID637881009 
Productthioredoxin-related 
Protein accessionYP_482764 
Protein GI86742364 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID[TIGR01068] thioredoxin 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.418332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.641891 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGTA GGCCACCAGT CCCATCCAGA TCGCCGCGCG GTGCGGGAGG CGCGGGACGT 
CTGCCCGAAA GCCTCGGCGG TCTGCGCCTC GCCGGCGCCG TCCCGCTCGA CCCCAAGCCG
GCGACCCCGC CTCCCCCCGC CAGCAGCGCC GCCACGGCCG CGCCGGACGC CCGTCCCGGC
GCGCCAGCCG GGCCCGTGGT CATCGACGTC ACCGAGGCGA GCTTCGCCAC CGACGTGGTC
AGCCGGTCGA TGCAGGTTCC CGTAGTGCTG GACTTCTGGG CCTCCTGGTG CGGCCCGTGC
AAGCAGCTCA GCCCGATCCT GGAGAAGCTC GCCGCGGCGG ACGGCGGCCG GTGGATCCTC
GCCAAGATCG ACGTCGACGC GAACCCGGGC CTCGCGCAGG CCGCCGACGT GCAGGGCATC
CCCGCGGTGA AGGCGGTGAT CGGCGGGCGG ATCATCGGAG AGTTCACCGG CGCCGTGCCC
GAACGGGAGG TGCGGAGCTG GCTCGATCAG CTGCTCTCCC TGGTCGGCGA GGCGATGGGC
GCAGTCCCGG GCGCGGCAGC CGCGGGCGGC CCCGCCCGTG ACCCACATGT GGCGGCGGCC
GAGGCGGCGC TTGCCCGTGG CGACTTCGAC ACCGCGGTCG AGTCCTACCG CATCCGGCTC
ACCGAGGCGC CCGCGGATCC CGAGGCCCTG ACCGGGCTCG CGCGGGCCGA ACTGCTGCGC
CGGGTGCATG GGTACGATCC GGTGGACGTC CGCAACCGGC TGGTGGCGAA TCCGGACGAC
GTCGAGGCCG CGGTTGCCGC GGCCGACCTG GGCATCGCGC AGGGAGACGT GGCCGGGGCC
CTCGCTGGTC TCGTCGAGGT CGTCCGCCGG ACGGCGGGAC CGGAGCGGGA AAGGGTCCGA
GCGCACCTCG TCGGGCTGTT CCAGGCGTTG GGTGACGAGG AGCCCGCGGT GGCCCCGGCT
CGACGCTCCC TGGCAGCCGC TCTCTTCTGA
 
Protein sequence
MQRRPPVPSR SPRGAGGAGR LPESLGGLRL AGAVPLDPKP ATPPPPASSA ATAAPDARPG 
APAGPVVIDV TEASFATDVV SRSMQVPVVL DFWASWCGPC KQLSPILEKL AAADGGRWIL
AKIDVDANPG LAQAADVQGI PAVKAVIGGR IIGEFTGAVP EREVRSWLDQ LLSLVGEAMG
AVPGAAAAGG PARDPHVAAA EAALARGDFD TAVESYRIRL TEAPADPEAL TGLARAELLR
RVHGYDPVDV RNRLVANPDD VEAAVAAADL GIAQGDVAGA LAGLVEVVRR TAGPERERVR
AHLVGLFQAL GDEEPAVAPA RRSLAAALF