Gene Francci3_3628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3628 
Symbol 
ID3904184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4332569 
End bp4334218 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content70% 
IMG OID637880951 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_482709 
Protein GI161353753 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.883953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.554164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCGGGG CGGAACGACC CAAGATCCTT GCGGAGGGGC GCCCCGTGCG GCCATACGAT 
CCCGATTCCC TGCACGTCTA TGACACGACA TTGCGGGACG GTACCCAGCA GGAGGGGCTG
TCGCTCTCGG TCGGCGACAA ACTCGCGGTG GCCCGTCACC TGGACGATCT CGGCGTCGGT
TTCATCGAGG GCGGCTGGCC GGGATCGAAC CCGAAGGACG CGGAGTTCTT CACCCGGGCG
CGCGGCGAAC TGAACCTGAC GACCGCCGTG CTGACCGCCT TCGGGGCCAC CCGGCGGGCG
GGCCGCGCCG CCGCGGACGA CCCCCAGGTG GCGGCGCTTC GGGACGCGGG GACCTCGGTG
GTCTGCCTGG TCGCCAAGTC GGATCTGCGC CATGTCGAAC GTGCCCTGCG CACCACCCCG
GCCGAGAACC TCGCGATGAT CCGGGACACC GTCGCGCATC TGCGCGCGGA GGGAAAGCGG
GTGTTCGTTG ACGCCGAGCA CTTCTTCGAC GGCCATCGGG TGTATCCCGA CTACGCTCTT
GAGGTCGTCT CCGTTGCGGC CGAGGCCGGT GCCGAGGTCG TCGTGCTGTG CGACACGAAC
GGTGGCATGC TCCCGACGCG GATCGGCGCG GTCGTCGCCG ATGTCCTCGC CGCAACCGGC
GCCCGGCTCG GTATCCACTG CCACGACGAC GCCGACTGCG CGGTCGCGAA CACCCTGGTG
GCGGTGGAGG CCGGCGTCAC CCACGTGCAG GGCACCGCAA ACGGCTACGG CGAGCGGTGC
GGGAACGCCA ACCTGCTGAG CGTCATCGCG GGCCTGGAGA CCAAGTTGGA CCGGGCCGTG
CTGCCGCCCG GGCGGCTGCG TGAGCTCGTC CGGGTCTCTC ACGCGATCGA CGAGGTCACC
AACTCCGTGC CCGACCCGCA CCGGCCGTAC GTCGGGGCCA GCGCCTTCGC GCACAAGGCC
GGCCTGCACG CCAGTGCCGT CAAGGTCGAT CCGGACCTGT ACCAGCACAT CGACCCGGCG
GTCGTGGGCA GTGACATGAG GATGCTGGTC TCGGAACTCG CCGGTCGGGC GACCATCGAA
CTGAAGGGCC GTGAGCTCGG CCTCGACCTG TCGAACCAGC GTGAGGCGCT CGGACGGGTG
GTGGACCTGG TCAAGGAGCG GGAGGCAGCC GGATACGTCT ACGAGGCCGC GGAGGCCTCC
TTCGAACTCC TGCTGCGCGA CGAGGTGACC GGCCGGCAGC GGTTCTACAC CCTCGAATCC
TGGCGCGTCA TCGTCGAGCA GCGTCCGGGC GGCGAGGTGG CCAGCGAGGC GACGGTGAAG
CTCACCTCGC ACGGCGAACG CCACGTCGCG ACGGCGGAGG GGAACGGACC GGTCAACGCT
CTCGACACGG CACTGCGCAA CGCGCTGGAG AAGGCGTATC CGGGGTTGGC CGATCTGGAC
CTGGTGGACT ACAAGGTTCG GATCCTCGAC GGCAAGCAGG GGACCGGCGC GGTGACCCGG
GTCCTGCTGG GAACCAGCGA CGGCCGCGAG CGCTGGGACA CGATCGGCGT GGACGAGAAC
ATCATCGCAG CCTCCTGGGC GGCCCTGGAG GACGCTGTCG ACTACGGACT GCGGCGGCAG
GGGGAGAGCC CGGACCCGGT CGCGCCCTGA
 
Protein sequence
MAGAERPKIL AEGRPVRPYD PDSLHVYDTT LRDGTQQEGL SLSVGDKLAV ARHLDDLGVG 
FIEGGWPGSN PKDAEFFTRA RGELNLTTAV LTAFGATRRA GRAAADDPQV AALRDAGTSV
VCLVAKSDLR HVERALRTTP AENLAMIRDT VAHLRAEGKR VFVDAEHFFD GHRVYPDYAL
EVVSVAAEAG AEVVVLCDTN GGMLPTRIGA VVADVLAATG ARLGIHCHDD ADCAVANTLV
AVEAGVTHVQ GTANGYGERC GNANLLSVIA GLETKLDRAV LPPGRLRELV RVSHAIDEVT
NSVPDPHRPY VGASAFAHKA GLHASAVKVD PDLYQHIDPA VVGSDMRMLV SELAGRATIE
LKGRELGLDL SNQREALGRV VDLVKEREAA GYVYEAAEAS FELLLRDEVT GRQRFYTLES
WRVIVEQRPG GEVASEATVK LTSHGERHVA TAEGNGPVNA LDTALRNALE KAYPGLADLD
LVDYKVRILD GKQGTGAVTR VLLGTSDGRE RWDTIGVDEN IIAASWAALE DAVDYGLRRQ
GESPDPVAP