Gene Synpcc7942_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1898 
Symbol 
ID3775261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1970822 
End bp1972228 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content59% 
IMG OID637800339 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_400915 
Protein GI81300707 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00568531 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.587637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCG GTACCCTGTT CGACAAAGTT TGGGACTTGC ACACGGTTGC CACCTTGCCC 
TCGGGCCAGA CGCAGCTGTT CATTGGCCTG CACCTGATCC ACGAAGTCAC CAGCCCGCAA
GCCTTTTCGA TGCTGCGCGA TCGCGGCTTG ACCGTGAAAT TTCCGGGACG AACGGTCGCA
ACGGTTGACC ATATTGTGCC GACGGAAAAT CAGGCGCGTC CTTTTGCAGA CAGCTTGGCC
GAGGAGATGA TTGTCACCTT GGAGCGCAAC TGCCGCGAGA ATGGCATCCG CTTCTACAAC
ATCGGTTCGG GCAGTCAGGG CATCGTCCAT GTGATTGCGC CAGAGCAAGG CTTGACTCAG
CCCGGCATGA CGATCGCTTG CGGTGATAGC CATACCAGTA CTCACGGGGC GTTTGGTGCC
ATCGCTTTTG GGATTGGCAC CAGCCAAGTT CGCGATGTCT TGGCCTCACA AACGCTGGCG
CTGAGTAAGC TGAAAGTTCG CAAGATCGAA GTCAACGGCG AGCTGCAGCC CGGCGTCTAC
GCCAAGGACG TGATTCTGCA CATCATCCGT AAGCTCGGTG TCAAAGGTGG CGTCGGCTAC
GCCTACGAGT TCGCGGGTAG CACTTTCGCG GCGATGTCGA TGGAAGAACG GATGACCGTC
TGCAACATGG CGATCGAGGG CGGGGCGCGT TGCGGCTACG TCAATCCAGA TCAGATCACC
TATGACTATC TGCAAGGTCG TGAGTTTGCG CCCCAAGGCG AAGCCTGGGA TCGGGCGATC
GCTTGGTGGG AGAGCCTGCG CAGCGAAGCC GATGCGGAAT ATGACGATGT TGTTGTCTTT
GATGCGGCGG AGATTGCACC GACCGTGACT TGGGGGATTA CCCCCGGCCA AGGCATCGGG
ATCACGGAGA CCATCCCGAC GCCCGATAGT TTGCTCGATG AAGATCGGGC GGTAGCGGCG
GAAGCCTACA GCTACATGGA TTTGGAGCCT GGTGCACCGC TGCAAGGTAC GAAAGTTGAT
GTCTGTTTCA TCGGTAGTTG CACTAATGGC CGGTTGAGCG ACTTGCGCGA AGCTGCCAAG
GTTGCCCAAG GTCGCAAGGT CGCGGCGGGG ATTAAAGCCT TCGTGGTGCC CGGTTCCGAG
CGCGTCAAAC AGCAAGCCGA AGCCGAAGGC CTCGACCAAA TCTTTACGGC GGCAGGCTTT
GAGTGGCGGC AGGCCGGCTG CTCGATGTGT CTGGCGATGA ACCCGGACAA ACTGGAAGGC
CGCCAAATCA GTGCTTCTTC ATCCAACCGC AACTTCAAGG GGCGCCAAGG CTCAGCTTCG
GGTCGGACGC TTTTGATGAG TCCGGCGATG GTGGCGGCAG CAGCGATCGC GGGCGAAGTG
ACGGACGTTC GTAACTGGCT GAACTAG
 
Protein sequence
MSRGTLFDKV WDLHTVATLP SGQTQLFIGL HLIHEVTSPQ AFSMLRDRGL TVKFPGRTVA 
TVDHIVPTEN QARPFADSLA EEMIVTLERN CRENGIRFYN IGSGSQGIVH VIAPEQGLTQ
PGMTIACGDS HTSTHGAFGA IAFGIGTSQV RDVLASQTLA LSKLKVRKIE VNGELQPGVY
AKDVILHIIR KLGVKGGVGY AYEFAGSTFA AMSMEERMTV CNMAIEGGAR CGYVNPDQIT
YDYLQGREFA PQGEAWDRAI AWWESLRSEA DAEYDDVVVF DAAEIAPTVT WGITPGQGIG
ITETIPTPDS LLDEDRAVAA EAYSYMDLEP GAPLQGTKVD VCFIGSCTNG RLSDLREAAK
VAQGRKVAAG IKAFVVPGSE RVKQQAEAEG LDQIFTAAGF EWRQAGCSMC LAMNPDKLEG
RQISASSSNR NFKGRQGSAS GRTLLMSPAM VAAAAIAGEV TDVRNWLN