Gene Franean1_4698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4698 
Symbol 
ID5673040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5612661 
End bp5613662 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content73% 
IMG OID641243555 
Productsucraseferredoxin family protein 
Protein accessionYP_001508971 
Protein GI158316463 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4759] Uncharacterized protein conserved in bacteria containing thioredoxin-like domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCTCC GTACCACGTC CCCGGCACTG TCCTACCGTT GCGCGCCCTG GACGCACGCG 
CAGGGCGTCG ACCCGGTCGG GTCCGCCCTG ACCTGCGACA CGCTCGTGCT CATCGAGGTG
CCACCGCCCT GGCCCCGGGA CGTCGGCGAG ATACCGGCCT TCGCCGACCT CCAGCGGCGT
GACCTTCGCC GAACCAGGGT GCTGGCGGTC AGACCCCCTG CGGACGATTT CAACGACCCG
ATCGGGAAGG CCGCCGTCCC GGTGGGCTCC GCGGTCGCCA GCGATCAGCC GGGACCAAGC
GTCGGGTGCG GTGTGCGGGT GACGATCTGG CGCCGGGTGG ACTCCGGCCG TTTCGTGGGT
ACCGACCACC TCGTGCCCGC CGAAGGCATC GCCGACGAGG TCGCCCGGCT GCTCGAGGCG
CCGCAGGCGG ACCCGACGAG CCGGACCGCA CCCGCCGAGG TGCTGCTGTG TGGGCACGGC
GCGCGGGACC GCTGCTGCGC GCGCCTGGGG ACTCGCCTGG CACTGGACGT GGCCGCGGCC
TGGCCAGGTG TCCGCGTCCG CCGGTGCAGC CACACCGGCG GTCACCGCTT CGCTCCGACC
GGGTTCACGC TGCCGGACGG GCGGGCCTGG GGGTTCCTCG ACGTCGAGAG CCTCGAGGTG
ATCATGCGTC GATCCGGGCC GCCGCCGCTG CGGGGCCACT ACCGCGGTAA CACCGCGCTG
GACGCCTGGG GACAGGTGGC CGAACGGGAG CTGTTCGAGC GGTTCGGCTG GGCCTGGCTG
GACCACCAGC TCACCTCCTC TCGCACCGAG ATCGCGGCCG GGGGACGGTC GGCAACCGTG
GAGCTGGCCT GGGGCGGACC GACCGGTCCC GCTACGGCGA CTGCGAGGAT CGACGTCATC
CGCGACGTTC CCGTCCTCGT CTGCGGCGAG CCTCCCGAGC GGGCCGAGAA GACGGCACCG
GAACTGGCAC TACGCTCCAT CAACCTCGCC GGCAGAGGCT GA
 
Protein sequence
MSLRTTSPAL SYRCAPWTHA QGVDPVGSAL TCDTLVLIEV PPPWPRDVGE IPAFADLQRR 
DLRRTRVLAV RPPADDFNDP IGKAAVPVGS AVASDQPGPS VGCGVRVTIW RRVDSGRFVG
TDHLVPAEGI ADEVARLLEA PQADPTSRTA PAEVLLCGHG ARDRCCARLG TRLALDVAAA
WPGVRVRRCS HTGGHRFAPT GFTLPDGRAW GFLDVESLEV IMRRSGPPPL RGHYRGNTAL
DAWGQVAERE LFERFGWAWL DHQLTSSRTE IAAGGRSATV ELAWGGPTGP ATATARIDVI
RDVPVLVCGE PPERAEKTAP ELALRSINLA GRG