Gene Francci3_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2017 
Symbol 
ID3906733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2369159 
End bp2370286 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content66% 
IMG OID637879353 
Producthypothetical protein 
Protein accessionYP_481120 
Protein GI86740720 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.971406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACAAGA TCCAAACAAT CGAGGAGCTA CACACCACGG CATCCGATCT CACCGGACTC 
ACCGACTTCG GCCCGGCGGA CTATCTGGAA GGGCTCGAGG TGCTCCTGGC CTCCTATCAG
GAAGAGGCGA GCCTGACGCC GCACGGCGTC CAGCTGGTCC AGGACGAGTT GTGCGGCATC
CTGATGGCCC GCCTGTTCAG TGAGGCCGGG TGGCAGAGAC ATCCCGAGCA CGCCCAGGTC
CCGATCGAGA GGCCGGTGTT CATCGTCGGT ATGCCGCGGA CCGGCACCAC CACCCTGCAC
CGGCTACTCA CCGCCGACTC GGCCAACCAA GGGCTCGAAC TGTGGCTGGG CTACGCTCCC
CAGCCCCGGC CCGCACGCTC GACCTGGCCT ACCAACCCGA TCTTCCAGAT GGTCCAGGGG
GGCGTCGACA AGTTTGTCGA GCAACATCCC GGCTACCTGG GCATCCACAA TCGCAAAGCC
GGCGAGGTCG AGGAGTGCTG GCTGTTGACC CGCCAGTCGA TGGTCTCGCC GTATTTCGAG
TTCACCGGGT ACGTGCCGAC GTACTCCGCG TGGCTGGCAG GCCGGGACTC GACCGAAGCG
TACCGGCGGC ATCGCCGCAA CCTCCAACTG ATCGGGCTGC ATGACCCCGG GCGACGCTGG
GTACTCAAGT CCTCCAGTCA CATGCCCTGT CTGGATGCCC TGCTGGCGAC CTACCCCGAC
GCAATGGTGA TCCAGACCCA CCGCAGACCC GCTAGTACGG TACTGGGATC GGCGTGCAGC
ATGGCGAGCA AGCTTGCCGC GGGCATGTCC TCAGTTTTCC AGGGCGAGGT GATCGGCCCT
ACCCTGCTCG CACTCGCCAC TCGCACTCTT GCGCGATTCG CCACGGAGCG GGCCAAACAC
GACCAGGCCA GATTCTACGA CGTCGAATTC GACGAGTTCA CCGCCGACCC GCTGGCGGTC
GTCGCCGACA TCTACCGCCA CCTCGGATGG GACCTGGCCA ACGAGGTCCG CCCGGCCATG
TCGGCCGTGC TGGCCGAAGA CGCACGCCTC CGTTCGCACC GCTACGACCT GGCCCAATTC
GGCATCAGCG CCGAGGAAGT CGACTCCCGA CTCGGCACCC TGCTCTGA
 
Protein sequence
MDKIQTIEEL HTTASDLTGL TDFGPADYLE GLEVLLASYQ EEASLTPHGV QLVQDELCGI 
LMARLFSEAG WQRHPEHAQV PIERPVFIVG MPRTGTTTLH RLLTADSANQ GLELWLGYAP
QPRPARSTWP TNPIFQMVQG GVDKFVEQHP GYLGIHNRKA GEVEECWLLT RQSMVSPYFE
FTGYVPTYSA WLAGRDSTEA YRRHRRNLQL IGLHDPGRRW VLKSSSHMPC LDALLATYPD
AMVIQTHRRP ASTVLGSACS MASKLAAGMS SVFQGEVIGP TLLALATRTL ARFATERAKH
DQARFYDVEF DEFTADPLAV VADIYRHLGW DLANEVRPAM SAVLAEDARL RSHRYDLAQF
GISAEEVDSR LGTLL