Gene Francci3_3531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3531 
Symbol 
ID3904470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4219563 
End bp4220846 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content71% 
IMG OID637880852 
Producthypothetical protein 
Protein accessionYP_482612 
Protein GI86742212 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0966276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.218294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTACA CCGACCCGTA CACGCCACCG CCCGAGCACC ATCCCGCACC CGGCGGGGGT 
CAATCCCTGG CAGACGGGGC CGCCGGGCTG GCGCTGCTGC ACATCGCCTA TGCCCGCGCC
GGGATCGGAG ACTGGGCCAC CGCCCACCAA TGGGTCAAGG CCATGACCGC CGAACCCGTG
GCCGCGGACA CGGGCGCCGG CCTGTACCGG GGCGCCCCGG CGGTCGCATT CGTGCTGCGC
ACCGCCGGCC AGCCCGTCTA CACCGGCGCA CTGCACACCC TCGACGAGCA CATCGCCGCG
ATCACTTGCA CCCGCCTGGA GGCAGCGCAC GAGCGGATCG ACCGCGGCGA ACTGCCCGCG
CTGCGGGAGT TCGACCTGAT CAACGGGCTG ACCGGACTCG AATGGGTCAA CGCCGCCGCC
GACCAGCCGG GTCCGCAACG CCCGTCCTGG TGCTACGGCG CCCCCGGCCT GGCCCGCGCC
CGCCACCTGA CCGCCCAGGC ACTGGACACC CCGAACAGGG TGGCCGACGC CAAGGCCACG
CTCGTCGCGT GTCTCACCGA CGAGGCTCAA CTCGCCCAAC TCGGCGACGA CTCGCTGTGC
CACGGCTGGG CCGGGCTGGT GCACGTCTCC CGCCGGATAC TCGCCGACAC CGAACCCGGC
GGCGAATTCG CCGAGGTCCT GTCCCGGTTG GAACACCGCT GGCGTCACCG CCGCGCCCAG
GCGGCCCGAA AACTATCGGA GGTGAGGGGG ATGCTGGAAG GCGACGCCGG AATCGCGCTC
ACCGATCTGC CGCCGGGCAC CGGATGGGAC GCCTGCCTGC TCACCGTCCC GCCCACGGCC
GGACCAACGC ACTCGCCCGT ATCCGCAAGT ACACACACGA AGGAACCGGA TGACCAACGC
CACCAGCACC CGCCCCGAGG ACCTGCGCGA GCAGATGATC AGCAACATCC GCACCGCTGG
TCACCTGCGC TCCGAGCGCA TCGAGCAGGC GTTTCGGGCC GTTCCCCGGC ACCGGTTCGT
TCCCGCGGCC TCGGTCGAGG AGGCGTACGC CAACAAGGCG ATCACCATCA AGCCCGGCGC
AGACCGGCCC GCCAGTTGCA TCTCCGTGCC GACCGTGGTG GCGATGATGC TCGGTCAGCT
CGAACTGACC ACGCCCGCCG CGCCCTGGCC GAGACCAGCT ACGACCGAGT GCGGGTGGTC
ACCGGCGACG GCGCCATCGG CGACGCGGAC CACGCCCCCT ACGACAAGAT CATCGTTACG
GAACTGTTGA CCGGACTTTC CTGA
 
Protein sequence
MTYTDPYTPP PEHHPAPGGG QSLADGAAGL ALLHIAYARA GIGDWATAHQ WVKAMTAEPV 
AADTGAGLYR GAPAVAFVLR TAGQPVYTGA LHTLDEHIAA ITCTRLEAAH ERIDRGELPA
LREFDLINGL TGLEWVNAAA DQPGPQRPSW CYGAPGLARA RHLTAQALDT PNRVADAKAT
LVACLTDEAQ LAQLGDDSLC HGWAGLVHVS RRILADTEPG GEFAEVLSRL EHRWRHRRAQ
AARKLSEVRG MLEGDAGIAL TDLPPGTGWD ACLLTVPPTA GPTHSPVSAS THTKEPDDQR
HQHPPRGPAR ADDQQHPHRW SPALRAHRAG VSGRSPAPVR SRGLGRGGVR QQGDHHQARR
RPARQLHLRA DRGGDDARSA RTDHARRALA ETSYDRVRVV TGDGAIGDAD HAPYDKIIVT
ELLTGLS