Gene Francci3_0083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0083 
Symbol 
ID3905127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp101296 
End bp102663 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content65% 
IMG OID637877413 
Productcitrate synthase 
Protein accessionYP_479206 
Protein GI86738806 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGAGC AGGTCAGCAC ACTGACCGTC ACCGACAACC GCACCGGAAA AACGTATGAG 
ATACCGGTTA CGGACGGCGC GGTACGTGCG AGTGCGTTTC GCTCGATCAA AACGAGCGAC
GACGACTTCG GGCTGATGAC GTACGACCCC GCCTTCACGA ACACGGCGAG CTGTAGGAGT
GCGATCACCT ACATCGACGG TGACGCCGGC ATCCTGCGCT ACCGCGGCTA TCCGATCCAG
GAGCTGGCGG AGAAGAGCAG CTTCCTCGAG GTCGCCTACC TGCTCCTCGC CGGCGAGCTT
CCCTCCGCCG AGGAGCTGTC GACCTGGGAA GACGAGATCA CCCATCACAC CCTGGTCCAT
GAGTCGATCA AGAAGTTCAT CGACGGCTTC CACCACGACG CGCACCCGAT GGGCATGCTG
GTCTCCACGG TAGGTGCGCT GTCGACCTTC TACCCGGACG CCAAGACGAT CGACGACCCG
GGGCTGCGCC GGCTGCAGAT CGTCCGCCTG ATCGCCAAGA TCACGACATT GGCGGCGTTC
TCCTACCGGC ACTCGGTCGG TTTCCCGTAC GTCTACCCGG ACAACGACCT GTCCTACGCC
GGCAACTTCC TCAACATGAT GTGGAAGGCC ACCGAGCTCA AGTACGAGCC GGACCCCAAC
CTGGAGCACG CCCTCGACGT GCTGTTCATC CTGCACGCCG ACCACGAGCA GAACTGCTCG
GCCAACGCGA TGCGCGCGGT CGGCAGCTCG CAGGCGGACC CGTTCTCGGC GGCCGCCGCC
GCGATCTCCG CGCTCTACGG TCCGCTGCAC GGTGGCGCCA ACGAGCAGGT GCTGCGGATG
CTCTCGGAGA TCGGGTCGGT CGAGAACATC CCCGCCTTCA TCGCCCAGGT GAAGGACGGC
AAGAAGAAGC TGATGGGCTT CGGCCACCGG GTCTACAAGA ACTACGACCC GCGGGCCCGG
GTGATCCGTC AGGTCGCCGA CGAGGTCTTC AAGGTCACCG GCACGAACCC GTTCCTCGAC
CTGGCCATGG AGCTGGAGCG GATCGCGCTG GAGGACGAGT ACTTCGTCGC GCGCAAGCTC
TACCCAAACG TCGACTTCTA CACCGGCATC ATCTACCAGG CGATGGGCTT CCCGGTAGAG
ATGTTCCCGG TGCTGTTCGC CATCGGCCGG ATGCCGGGAT GGCTGGCGCA GTGGGAGGAG
GGTCTACTCG ACCCCGAGCA GAAGATCGCC CGACCGCGCC AGCTGTACAT CGGCTACGAC
CAGCGGTCGT ATGTCCCGAT AGACGACCGT AGCGGGGTCG GGGACAGCGG GATCGACCCG
ATCGCGGCCC AGGCGGCCCA GGCGGCACCG GAGCGCCCGC TGCGCTGA
 
Protein sequence
MSEQVSTLTV TDNRTGKTYE IPVTDGAVRA SAFRSIKTSD DDFGLMTYDP AFTNTASCRS 
AITYIDGDAG ILRYRGYPIQ ELAEKSSFLE VAYLLLAGEL PSAEELSTWE DEITHHTLVH
ESIKKFIDGF HHDAHPMGML VSTVGALSTF YPDAKTIDDP GLRRLQIVRL IAKITTLAAF
SYRHSVGFPY VYPDNDLSYA GNFLNMMWKA TELKYEPDPN LEHALDVLFI LHADHEQNCS
ANAMRAVGSS QADPFSAAAA AISALYGPLH GGANEQVLRM LSEIGSVENI PAFIAQVKDG
KKKLMGFGHR VYKNYDPRAR VIRQVADEVF KVTGTNPFLD LAMELERIAL EDEYFVARKL
YPNVDFYTGI IYQAMGFPVE MFPVLFAIGR MPGWLAQWEE GLLDPEQKIA RPRQLYIGYD
QRSYVPIDDR SGVGDSGIDP IAAQAAQAAP ERPLR