Gene Franean1_4160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4160 
Symbol 
ID5672515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4941642 
End bp4943297 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content77% 
IMG OID641243033 
Productcobyrinic acid a,c-diamide synthase 
Protein accessionYP_001508450 
Protein GI158315942 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1797] Cobyrinic acid a,c-diamide synthase 
TIGRFAM ID[TIGR00379] cobyrinic acid a,c-diamide synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0028663 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.287821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGCG CCGGGCTGAT CCCCCGGCTG GTGCTCGCCG CCCCGGCGTC CGGCGCGGGG 
AAGACGACGG TGGCGACCGG GCTGATGGCC GCGCTGACCG CCCGCGGCCT GCGGGTGTCG
GGTCACAAGG TGGGCCCGGA CTACATCGAC CCGGGTTATC ACGCGCTGGC GACCGGCCGG
CCCGGCCGCA ATCTGGACGC GGTGCTGTGC GGGCCGGATC TGATCGGCCC GCTGTTCGCC
CACGGGGCGG CCGGCGCGCA GCTGGCGGTC GTCGAAGGGG TGATGGGCCT GTTCGACGGG
GTCGCCGCCC CTGCTGCCGG GCAGGAGGCT GATCACGGGT CGACCGCGCA TGTCGCCCGG
CTCCTCGACG CGCCGGTGGT GCTGGTCGTC GACGCCGCCG GGGCCGGGCG GTCGGTGGCC
GCGCTGGTCT GCGGGTTCGC CGCGTTCGAC CGGCGGGTGC GCCTCGCCGG GGTGATCCTC
AACCGGGTCG GGTCGGACCG GCACCGACAG ATCCTCACCG GTGCGCTGGC CGGGATCGGG
GTGGCGGTGC TGGGCGCGGT GCCCCGCGAC GGCGCGGTGC ACACCCCGTC GCGGCATCTG
GGGCTGGTGC CGGCGGCGGA ACGGGCGGTG GCGGCCGCGC AGGCGGTGCG GCGTCTCGGT
GTGCTGGTCG GGGCGGCGGT CGACCTGGAC GCGCTGATCC GCCTCGCCTC CTCGGCGCCA
CCCCTGCCGG TCGACCCGTG GGATCCCGCC CGGCAGATCG CGCAGGCCAC CGCCACCGGT
GCTGCGGGTC GGGTCGGGCA GCACACCGTC AACGCGGGCG CGGGGGGTGT GGTGCCCGGG
GGGCGGCCGG TGCGGATCGC GGTGGCCGGC GGGGCGGCGT TCACCTTCGG CTACACCGAG
CACGTCGAGC TCCTCACCGC CGCTGGCGCG CAGGTGCTCA CCGTCGACCC GCTGCGCGAC
GAGACCCTCC CGGACGGCAC GGACGCGCTG GTCGTCGGTG GCGGGTTCCC CGAGGAGCAT
GCCGGCGCGC TGGCGGCGAA CAGCCGGCTG CGTGGGCAGG TCGCGGCGCT GGCGGCCCGC
GGCGCGCCGC TGGTCGCCGA ATGCGCCGGG CTGCTCTACC TGGGCCGTTC GTTGGACGGG
ACGGCGATGT GTGGGGTTCT CGACACCGAC GCGGTGATGG GCCCGCGGCT CACCCTGGGC
TACCGGCATG CGGTCGCCGC GGCTGACAGC CCGCTGGTGG CGGCGGGGAC GGTCGTCACC
GCCCACGAGT TCCACCGCAC CCGGCTGAGC GTCGACCGGG CGGAGCTGCC CGGGACACCG
GCCTGGCAGG TGGACATCCC ACCGCCGCGG TTCGGTGACA GCAACCCCGC CGCCGGTGCC
GCCGGTGCCG GTGGGCGGGT GGACGGCGGA GTGGACGGTA CGGCGGCGGG TGGGCGGCCT
GAGGGGTTCG TCCGCGGCGG GGTGCACGCC TCCTACCTGC ACCTGCACTG GGCGGGCCTA
CCCGCCGTGC CGGCGCGGCT CGTCGCCGCC GCCGGTACCG CCCGCACCCG CCCGTCCGGC
GGCACCCCAC CCGCCGGCGG CGGGCATGGG CATGGACGGG ACGCCGGTGG CAATCCCCCG
GCGTCCCCTG GAACAACGGA GGTGTCATCG CGGTGA
 
Protein sequence
MVSAGLIPRL VLAAPASGAG KTTVATGLMA ALTARGLRVS GHKVGPDYID PGYHALATGR 
PGRNLDAVLC GPDLIGPLFA HGAAGAQLAV VEGVMGLFDG VAAPAAGQEA DHGSTAHVAR
LLDAPVVLVV DAAGAGRSVA ALVCGFAAFD RRVRLAGVIL NRVGSDRHRQ ILTGALAGIG
VAVLGAVPRD GAVHTPSRHL GLVPAAERAV AAAQAVRRLG VLVGAAVDLD ALIRLASSAP
PLPVDPWDPA RQIAQATATG AAGRVGQHTV NAGAGGVVPG GRPVRIAVAG GAAFTFGYTE
HVELLTAAGA QVLTVDPLRD ETLPDGTDAL VVGGGFPEEH AGALAANSRL RGQVAALAAR
GAPLVAECAG LLYLGRSLDG TAMCGVLDTD AVMGPRLTLG YRHAVAAADS PLVAAGTVVT
AHEFHRTRLS VDRAELPGTP AWQVDIPPPR FGDSNPAAGA AGAGGRVDGG VDGTAAGGRP
EGFVRGGVHA SYLHLHWAGL PAVPARLVAA AGTARTRPSG GTPPAGGGHG HGRDAGGNPP
ASPGTTEVSS R