Gene Francci3_2495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2495 
Symbol 
ID3904873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2944039 
End bp2945838 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content68% 
IMG OID637879825 
Productthiamine pyrophosphate protein 
Protein accessionYP_481591 
Protein GI86741191 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.215575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.718329 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGG TCGCGGACTA TGTCCTGCAG CGGTTGACCG GCTGGGGTGT TCACCGCATA 
TTCGGCTATC CCGGGGACGG CATCAACGGC TTCCTCGGGG CTTTCGACCG GGCCGGCGGT
GATCCTGGGT TCCTGCAGAC CCGGCACGAG GAGATGGCGG CGTTCATGGC CTGCGCGCAC
GCCAAGTTCA CCGGTGAGGT AGGCGCGTGT GTCGCGACCT CCGGCCCGGG GGCGATCCAT
CTGCTCAACG GTCTCTACGA CGCCCGGCTC GACCATCAGC CGGTGCTGGC GATCGTCGGC
CAGCAGCGGC GGACCTCCCT CGGCGCCCAC TACCAGCAGG AAATCGATCT GATCTCGCTG
TTCAAGGACG TCACCGAGTA CGTGCAGTAC TGCATGGCCC CGCCCAGCGC CCGTCATCTC
GTCGATCGGG CGATGAAGAC AGCGCTGTCC AGCCGGGCGC CGGTCTGCCT GATCTTCCCG
GAGGATGTCC AGGAGGAGAA GTACACCGAG CCTCCGCATG AGCACGGGGC GGTGCGGACC
AGCATTGGCT GGACGAAACC ACGCATCCTG CCGGATCCGG ACGAGGTGCG CCGCGCCGCC
GCCGTGCTGA ATCGGGGCCG CCGGGTGGCG ATGTTGATCG GGCAAGGGGC GGCCGGCGCC
CGCGAGGAGG TGACGGAGGC CGCCGACCTG CTCGGCGCGG GTGTCGCCAA GGCCCTGCTC
GGCAAGGATG CGCTCCCCGA TACCCTGCCG TTCGTGACCG GCCCCATCGG GCTGCTCGGC
AGCGAAGCCA GCCACAAGAT GGTGATGGGC GCCGACACGC TCCTCCTGGT CGGGACGAGC
TTCCCCTACT CGGAGTGGCT GCCGCGGGAG GGCCAGGCCG CCGGCGTCCA GATCGATATC
GACGGCCGGA TGATCGGTAT TCGTTATCCG ATGGATGTTC ATCTCGTCGG CGACGCGGCC
GAGACGCTGC GCCAGTTGAT TCCCCTGCTC ATCCGCAAGG AGGACCGTTC CTGGCGGCGG
TTCATCGAAC GGGAGGTGGC GACCTGGCAG CGGGTGCTGG CGGACCGGGC CAGGCTGCGG
GCGGATCCGA TGAACCCGCA GATCGTCGCG TACGAGTTGG ACAAGCGACT GCCCGATAAC
GCGATCCTCA CCGCGGACTC CGGGTCGGCG ACCACCTGGT GGGCCCGTTA CCTGCGCATC
CGTGGGGACA TGAAGGCGTC GTTGTCGGGA ACCCTGGCGA CCATGCTGCC CGGAGTGCCC
TACGCGGTGG CGGCGAAGCT GGCCTATCCG GAACGACCGG TGATCGCGTT CGTCGGCGAC
GGGGCGTTCT CGATGCTGGG CATGAACGAG CTGTTCACGG TCAAGCGGTA CTGGGAGAGG
ATGAACACCG ATCCGCGGCT GGTGTTCACC GTGTTCGTGA ACGAGGACCT CAACCAGGTC
TCCTACGAGC AGCGGGTGAT GGCGGGTGAT CGGATCAATG TCGAGACGCA GAAGATCCCG
TATGTGCCGG CGGCGGATTT CGCCCGGCTC CTCGGTTTCA CCGGGATCCG CTGCGACTCG
CCGGACAAGA TCGGTGCCGC GTGGGAACAG GCGCTGGCTG CGGACCGGCC GGTTGTGCTT
GAGGTCGTTG TCGACGCGAA GGTGCCGCCG CTGCCGCCGC ACGTCCGGCC CGAGCAGATG
CGCAAGACCG CCCGGGCGTT CCTGCAGGGC GACCCAGAGG CCGTCGGCAT CGCCGTGCAG
GGCTTCAAGG GCAAGTGGCA GGAGGCGAGG GAGCACCTCC CGCACGCCGC CCGCAGGTAG
 
Protein sequence
MSQVADYVLQ RLTGWGVHRI FGYPGDGING FLGAFDRAGG DPGFLQTRHE EMAAFMACAH 
AKFTGEVGAC VATSGPGAIH LLNGLYDARL DHQPVLAIVG QQRRTSLGAH YQQEIDLISL
FKDVTEYVQY CMAPPSARHL VDRAMKTALS SRAPVCLIFP EDVQEEKYTE PPHEHGAVRT
SIGWTKPRIL PDPDEVRRAA AVLNRGRRVA MLIGQGAAGA REEVTEAADL LGAGVAKALL
GKDALPDTLP FVTGPIGLLG SEASHKMVMG ADTLLLVGTS FPYSEWLPRE GQAAGVQIDI
DGRMIGIRYP MDVHLVGDAA ETLRQLIPLL IRKEDRSWRR FIEREVATWQ RVLADRARLR
ADPMNPQIVA YELDKRLPDN AILTADSGSA TTWWARYLRI RGDMKASLSG TLATMLPGVP
YAVAAKLAYP ERPVIAFVGD GAFSMLGMNE LFTVKRYWER MNTDPRLVFT VFVNEDLNQV
SYEQRVMAGD RINVETQKIP YVPAADFARL LGFTGIRCDS PDKIGAAWEQ ALAADRPVVL
EVVVDAKVPP LPPHVRPEQM RKTARAFLQG DPEAVGIAVQ GFKGKWQEAR EHLPHAARR