Gene Francci3_3166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3166 
Symbol 
ID3903888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3745624 
End bp3747588 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content76% 
IMG OID637880487 
Producttetratricopeptide TPR_2 
Protein accessionYP_482252 
Protein GI86741852 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.307175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.31292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGGG AAGACAGATC CAGTCAGCGA GGCGGGACGA CGCCGTTCAC GTCCCCACAC 
CCGCAGCGGC GTGATCGGGA CGCTTCAGGT GATCGTGGCA CGCCGGCAGA GGGACGTCGC
GCCGGGCGCC CGGTGCGTGG CGATGCCCCG GGCGTAAGGC GGGGCGGCGG ATCCGAGACC
GGCGCGTCGC GCGGACAGGG GGGTCCCTCT CAGGGCGCGC GCCAGGGTGC GCGCGGCGGC
GGGCAGGGCG GTGCGCGGGG AGGAACGCGG GGCGGGGCGG GCCAGTTCCG CCGCGGCCCG
GAGGGCCAGG GCCGGCCGGA CCGCTCCAGG GGTTCGGACT CCAACACCCG GTACGGCAAC
GCCCGGTACG GCGCATCGCC GCGTGCGGGC GATGCGGGAT CCGCCACTTC CTCTTCCTCC
TCGGCCGCCT CGTTCGTCGA CCGCCGCCCG GGTGGGGCTC GGCGTCCGGA GGGGACGGGG
ACCGGCGGTG GGCGTGCTCC GGCGGCTGGC GGTGAGCGCT TTTCCCGAGC CGGCGGCCCG
CGCGGTGACC GGGGCGCGAG CGGTGACCGG GGCGCGAGCG GAGCCTCCGC CCGCAACCGG
GTCGCCGCCG AGGCGGCTTG GCGCGAGCGT CGGGGTCGGC CGGGAGCGGA GGAGACCGGC
CGGTCCGGAA GGCCGAGTAG CCGTCCCGAC CACGGCAGAC CGGGGCGCGA CTCCTCGGTA
TCCCATCGGC CCGGGCAGGC CCGGGCCGGC AGCGACGGCC CGCGGCCCCG GCACGACGGC
CCTGAACGGC CGACTCACGG CAGAGTCTGG CGTTCCGATG GCGACCGTCC CGCGGGTACG
CGCTCGGCCG GCCCGCGGCC CGGCGGCGAT CGAGGCCGGG AGTCCCGGGG GCCCCGGGAC
ACCAGCGGTG GAGGGGACCG CCGCGACGGC GCCGCTCGCG CCGAGCGCCG TGGCCGGCCG
TTCACCGAGG CTTCCAGGCC CGGGGGCGGC GGCCCCCGCG GCGGCCCCCG CGGCGGCCGT
CCGACCGGGC CTGGCCGGAC TGACAGAACC GACCGGGCTG GCCGGACTGA TCAGACTGAG
CAGGGCAGCC GGCGGGCCCG GGTGCCGGCT CCGCCACTTC CCGACGAGGC CCGCGCGGAC
CTGCTGGACC ACGACGTGCG CCGGTCCCTT CGGGGGCTGC CATCGTCGCT CGCCGACACG
ATCGCCCGCC ATCTCGTCGC GACCGCCCTG CTGCTCGACG ACGATCCCGC CCGCGCGCTC
GCCCACGCCC GCGCCGCCGC CGATCGGGTG CCGCGCCTGC CTGCGGTGCG CGAGGCGGTC
GGTATCGCGG CCTACCATGC GGGGGAGTAC GCGACGGCCC TGGTGGAACT TCGGGCCGCT
CGTCGGATGG ACGGTTCCGC GCACAACCTG CCGCTGATGG CGGACAGCGA ACGGGGACTG
GGCCGGCCGG AGCGTACCGT CGCCTACCTG CGGGATCCGC AGATCGACCA GCTTGATCCG
GCCGCTCGGG CGGAGATGCT CATCGTCGTC TCCGGGGCCC GCCGGGACCT CGGTCAGCCG
GAGGCCGCCG TCGTGTTGCT GCGGGATCTG GCCACGGCCA AGACGCCGCC CGCGCCGTGG
ACCGCCCGCC TCTGGTACGC CTATGCCGAG GCGCTGCTTG CCGCCGGTCG TCCAGAGGAG
GCCGCACGCT GGTTCACCTC CACGGCTGCG ATCGACGAGG GAGAGACCGA CGCCGACGCC
CGTGCGTATC TGATCATGAC TGGTCGGCCT CTCCCCGAGG AGGACGAGGA GGCTGCCCAG
GACGGCGCGG ATCTCCTGCT CTCGGATCGG GCCGATCCCG ACGCGGACGC CGGGACCGAG
GCGAACGTCG ATCAGGACAT CACCGGCCAT GATCTTACTG ATCATGGCAC CGGCACCGGC
ACCGAGGCTG CTCCCGAGGC CGAGGCGACG GAGACCGGGA GGTAG
 
Protein sequence
MPREDRSSQR GGTTPFTSPH PQRRDRDASG DRGTPAEGRR AGRPVRGDAP GVRRGGGSET 
GASRGQGGPS QGARQGARGG GQGGARGGTR GGAGQFRRGP EGQGRPDRSR GSDSNTRYGN
ARYGASPRAG DAGSATSSSS SAASFVDRRP GGARRPEGTG TGGGRAPAAG GERFSRAGGP
RGDRGASGDR GASGASARNR VAAEAAWRER RGRPGAEETG RSGRPSSRPD HGRPGRDSSV
SHRPGQARAG SDGPRPRHDG PERPTHGRVW RSDGDRPAGT RSAGPRPGGD RGRESRGPRD
TSGGGDRRDG AARAERRGRP FTEASRPGGG GPRGGPRGGR PTGPGRTDRT DRAGRTDQTE
QGSRRARVPA PPLPDEARAD LLDHDVRRSL RGLPSSLADT IARHLVATAL LLDDDPARAL
AHARAAADRV PRLPAVREAV GIAAYHAGEY ATALVELRAA RRMDGSAHNL PLMADSERGL
GRPERTVAYL RDPQIDQLDP AARAEMLIVV SGARRDLGQP EAAVVLLRDL ATAKTPPAPW
TARLWYAYAE ALLAAGRPEE AARWFTSTAA IDEGETDADA RAYLIMTGRP LPEEDEEAAQ
DGADLLLSDR ADPDADAGTE ANVDQDITGH DLTDHGTGTG TEAAPEAEAT ETGR