Gene Francci3_1561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1561 
Symbol 
ID3904793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1872352 
End bp1873689 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content71% 
IMG OID637878898 
Product4-hydroxybutyrate coenzyme A transferase 
Protein accessionYP_480666 
Protein GI86740266 
COG category[C] Energy production and conversion 
COG ID[COG0427] Acetyl-CoA hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.788962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCG TCAGCGAGGC GGAGTTCGGT CGCCGCATCG AGACCCATCT CGCCGCGGGT 
TCCGGTGTGG GCGGTGGAGC CGGTCCTCGT GGAGCCGGAG CCGGTACGGG GGTAGCCGGT
CCTGGCCGAT GGCGGTCGGC CACGCCGCGG GTCGTGGCGG CGGGGAACTT CGCCACCCCG
CTCGTCGCGT TGCGTGTGAT CGACGCGGTG GTGAGTGAAT ACCGCCTCTT TATGATCAAC
GCTCAGGGTG GCGTGCCGGA ACGCGGCGGC GTGACGCCGG AGACCGCGTT CGTCGGCCCG
GCGATGCGCA ACGTGCCCGG CCTCGACTAC CTGCCCAACC GGCTCAGCCT CGTGCCTCGC
CTGCTCGCGA CCACCCACCG GCCCGACGTC GTCGTGTTGC ACACCAGCGT GCCCGACGGG
GGGAAGGTCT CGCTCGGCAC CGAAGTCAAC ATCCTGCCGG CCGCGGTCGA GGCGGCCCGC
GCGCACGGGG GACTGGTGGT CGCCCAGCTC AACCCGGCGA TGCCCTACAC CTTCGGCGAC
GGTGAACTGA GCATCGATGA CGTCGACCTC GCCGTGGAGG TGGAGCAGCC GCTCGCCAGC
CCCGCGGTCA CGCCCGTCGA CGACGTCCGC GGGCAGATCG GCGAGCGGGT CGCCGCGCTC
GTCGAGGACG GGGCGACCCT GCAGCTCGGC ATCGGCGGTG TCCCGAACGC CACGCTGTCG
GCCCTCGTCG ACCGGCGGGA TCTGCGCGTG TGGACCGAGA CCTTCTCCGA TGGCATGCTC
GCGCTCGAAG CCTCCGGCGC GCTGGCCGCC GGGACACCGC TGCGGACCTC GTTCCTGTTC
GGGTCGGCCG AGCTCTACTC GTGGGCGCAC CGCAATCCGC GGCTGCTGCT GGTGCGCACC
GAAATCGTGA ACGACCCGGG GGTCATCGCC CGGCAGCCGC GGATGACCTC GATCAACACC
GCGCTGCAGG TGGATCTGTA TGCGCAGGCG AACGCGTCCT GGATCCGCAA CCGCATCTAC
TCCGGCTTCG GTGGGCAGTC CGACTTCGTC GTCGGCGCGC TGCACGCGGC CGACGGCAAG
GCGATCATCG CCCTGCCGAG CCGGCATGCC CGGTCGGGGG ATTCCTGTGT GCTGCCGCGG
CTCACCAGCC CGGTCACCAG TTTCCAGCAC AGCTACGTCG TGTCCGAGAA CGGGACCGCG
GCCGTGTGGG GGCGCGGCCA GCACGAACAG ACCGCCCGAC TCATCGACCA CGTCGCCCAC
CCCGACGCCC GAGCCGGCCT GACGGAGGCG GCCGGGTCGC TGGGACTGCT CGCCGGCCGC
GCATCGTCAA CCGTCTGA
 
Protein sequence
MDIVSEAEFG RRIETHLAAG SGVGGGAGPR GAGAGTGVAG PGRWRSATPR VVAAGNFATP 
LVALRVIDAV VSEYRLFMIN AQGGVPERGG VTPETAFVGP AMRNVPGLDY LPNRLSLVPR
LLATTHRPDV VVLHTSVPDG GKVSLGTEVN ILPAAVEAAR AHGGLVVAQL NPAMPYTFGD
GELSIDDVDL AVEVEQPLAS PAVTPVDDVR GQIGERVAAL VEDGATLQLG IGGVPNATLS
ALVDRRDLRV WTETFSDGML ALEASGALAA GTPLRTSFLF GSAELYSWAH RNPRLLLVRT
EIVNDPGVIA RQPRMTSINT ALQVDLYAQA NASWIRNRIY SGFGGQSDFV VGALHAADGK
AIIALPSRHA RSGDSCVLPR LTSPVTSFQH SYVVSENGTA AVWGRGQHEQ TARLIDHVAH
PDARAGLTEA AGSLGLLAGR ASSTV