Gene Francci3_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0043 
Symbol 
ID3903522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp51706 
End bp53304 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content71% 
IMG OID637877373 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_479166 
Protein GI86738766 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.162495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATTACG AGCCTGACCG ACGATCGGTG TTACGGACCG CCGCAGCCGG AGCCGGCGGG 
CTCGGGCTCG CCGCGCTCGG GCTCGGTGGC GGCCTGATGA GCGCCTGTCA GCAGGAGGAC
GCCCCCCACC CAATACCGAC GGAGATCGAC TGGCGCACGC TCGACGAACG ACTGACCGGT
CCGCTGCTGC GACCGACCGA TCCGGACTAC GCCGTCGCCA GCATGCTGTT CGACCCGGCT
TTCGACGCGG TACGTCCGCA GGCGGTGGTC CGTGCCATGT CCGCGGGGGA CGTCACGGCC
TGCATCGACT TCGCCCGCTC CACCGGGATA CATCTCGTCG CGCGGGCCGG TGGCCACAGC
TACGGGGGTT ACTCAACGAC GACCGGGCTG GTCGTGGACG TGACTGCGAT GGCGTCGGTC
CGGCCGGGGC CCGACGGCAC CGCGCTGATC GGGGCCGGAG CGCTGCTGAT CGACGTCTAC
TCCGCACTCG CCGAGAACGG GCTGGCGCTG CCAGCCGGAT CATGCCCGAC CGTCGGCATC
GCCGGGCTCG CCCTCGGCGG CGGCATCGGA GTGTTGAGCC GGCGCTACGG TCTGACCTGC
GATCGGATGG TCTCCGCCGA GGTCGTGCTG GCCTCCGGGG AGACCGTGCG CACCGACGCC
GACACCGAGC CGGACCTCTT CTGGTCGCTA CGCGGTGCGG GCGGGGGCAA CGTCGGCATC
GTGACGTCGT TCACCTTCGC CACCCATCGG GCGACACCGT TGGCGCTGTT CACCTACCGC
TGGCCCTGGG ACGTGGCGGC GGACGTGCTC ACCGCGTGGC AGGGCTGGAT CGCCGACAGC
GGCGGCGCGC CGGAGGACCT GTGGTCGACC TGCGTCGTGA CCTCGATGCC GACGACCGGG
GCCACCGGCA GCCCAGCCCT GCGGGTGAGC GGCGTGCTCG CGGGCGGCGC CGACGACACG
CGGATCACGT GGCTGCGGGA TCGACTCGCC GACCTCGTCG CCGCCGTCGG CCGGAGGCCC
TCGAGCACCT TCGTCGCCCA GCGTGGCCAT CTCGAGACGA TGTTGCTTGA GGCGGGCTGC
GCGGGCAAAA GCGTCGACGC GTGCCACCTG CGCGACCGGA CGCCCGGAGG CACCCTGCCA
CGGGTGGCCC AACGGGCTGC CTCGGCATTC CTCACCGAAC CGATGCCTGC CGGCGGGATC
GAGACGATGC TCGCCGCGCT CGAGCGACGC CAGCGCACAC CCGGGGCCGG TCCGGGTGGC
GTGATCCTGG ATTCCTGGGG CGGCGCGATC AACCGGGTCG GCCCGGGTGA CACGGCGTTC
GTGCACCGCA ACACGCTCGC CAGCGCCCAG TTCGTCGCCG GCTACTCCGT CGACGCATCC
CCTGCGGACA AGGAGGCTAA CCAGAGCTGG TTGCGATCCA CGGTGGCGGC GACGGCGCCG
TTCATGTCGT CGTCGGCGTA CCAGAACTAC ATCGACCCGG ACCTCACCAC CTGGGCCGAT
GCGTACTACG GGGCGAACCT TCCGCGGTTG CGCCAAGTCA AGCGCGCCTA CGACCCGGAC
AACCTGTTTC GCTTCGCGCA GAGCATCGCG CCGTCCTGA
 
Protein sequence
MDYEPDRRSV LRTAAAGAGG LGLAALGLGG GLMSACQQED APHPIPTEID WRTLDERLTG 
PLLRPTDPDY AVASMLFDPA FDAVRPQAVV RAMSAGDVTA CIDFARSTGI HLVARAGGHS
YGGYSTTTGL VVDVTAMASV RPGPDGTALI GAGALLIDVY SALAENGLAL PAGSCPTVGI
AGLALGGGIG VLSRRYGLTC DRMVSAEVVL ASGETVRTDA DTEPDLFWSL RGAGGGNVGI
VTSFTFATHR ATPLALFTYR WPWDVAADVL TAWQGWIADS GGAPEDLWST CVVTSMPTTG
ATGSPALRVS GVLAGGADDT RITWLRDRLA DLVAAVGRRP SSTFVAQRGH LETMLLEAGC
AGKSVDACHL RDRTPGGTLP RVAQRAASAF LTEPMPAGGI ETMLAALERR QRTPGAGPGG
VILDSWGGAI NRVGPGDTAF VHRNTLASAQ FVAGYSVDAS PADKEANQSW LRSTVAATAP
FMSSSAYQNY IDPDLTTWAD AYYGANLPRL RQVKRAYDPD NLFRFAQSIA PS