Gene Sala_2129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2129 
Symbol 
ID4080104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2233902 
End bp2235332 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content68% 
IMG OID638010505 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_617171 
Protein GI103487610 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGGC CCCGCACGCT TTACGAAAAA ATCTGGGACG CGCATGTCGT CGAACGGCGC 
GGCGACGGCA CCTGCCTGAT CTTCATCGAC CGCCACCTCG TCCATGAAGT CACCAGCCCG
CAGGCCTTTG CCGGGCTTCG CGCGAGCGGC CGCACGGTGC GCCGCCCCGA CCTGACGCTC
GCGGTGCCCG ACCATAATGT GCCGACGACG CCGCGCAAGG ATGCGGCGGG CAACCGGCTG
CCGATCGCCG ACCCCGAAAG CGCGGCGCAA TTGGCGGCGC TGGAGAAGAA CGCGCCCGAG
TTCGGCATCC GTTACATCGA CGCGATCGCG CCCGAGCAGG GCATCGTTCA TGTCGTCGGC
CCCGAACAGG GTTTTTCGCT GCCCGGCGCA ACGATCGTTT GCGGCGACAG CCACACGGCG
TGCCACGGCG GCATCGGCGC GCTCGCCTTT GGCATCGGGA CGAGCGAGGT CGAGCATGTT
CTGGCGACGC AGACCCTGCT GCTCCAGCCC GCGAAGACGA TGGAGGTGCG CGTCGAGGGC
GACGTCGGCC CCGGCGTCAG CGCGAAGGAC ATCATCCTGC ACATCACGGG CACCATCGGC
GCCGCGGGCG GCACGGGCCA CGTCATCGAA TATACCGGCA GCGCGATCCG CGCGCTGTCG
ATCGAGGGGC GGCTGACGAT CAGCAACATG GCGATCGAGG GCGGCGCGCG CGCCGGACTG
ATCGCGCCCG ACGAGACGAC TTTCGCTTAT CTCAAGGGCC GCCCCTACGC CCCGAAGGGC
GCCGACTGGG ACGCCGCGGT CGCTTACTGG AAAAGCCTGA CCACCGATCC CGGCGCGACT
TACGACAAGG TGGTCGTCAT CGACGCCGCC GACATCGCGC CCAGCGTGAC GTGGGGCACG
AGTCCCGAGG ACGTCGTGCC GATCACCGGC ACCGTGCCCG ATCCGGCAAG CTTTTCCGAT
CCGTCGAAAC GGGCCGCCGC CGCCAAGAGC CTGGCCTATA TGGGGCTCGA ACCCGGCACG
CGGATGCAGG ATGTGCCGGT CGAAAATATC TTCATCGGCA GCTGCACCAA CAGCCGGATC
GAGGATTTGC GCGCCGCCGC AGCGGTGCTG AAGGGTCGCA GGAAGGCGCC GGGCGTCAAA
TGGGCGATCG TCGTTCCCGG TTCGGGCCTG GTGAAGGCTC AGGCCGAGGC CGAGGGACTC
GACCGCATCT TCATCGACGC GGGACTCGAA TGGCGCGAGC CCGGCTGTTC GGCGTGCCTC
GCGATGAACC CCGACAAGGT GCCCGCGGGC GAGCGCTGCG CATCGACGAG CAATCGCAAT
TTCGTCGGTC GGCAAGGACC GGGTGCCCGC ACGCACCTCG TCAGCCCCGC GATGGCCGCG
GCGGCGGCGG TGACCGGAAA GCTCACCGAC GTGCGCGAAT TAATGGCATG A
 
Protein sequence
MTRPRTLYEK IWDAHVVERR GDGTCLIFID RHLVHEVTSP QAFAGLRASG RTVRRPDLTL 
AVPDHNVPTT PRKDAAGNRL PIADPESAAQ LAALEKNAPE FGIRYIDAIA PEQGIVHVVG
PEQGFSLPGA TIVCGDSHTA CHGGIGALAF GIGTSEVEHV LATQTLLLQP AKTMEVRVEG
DVGPGVSAKD IILHITGTIG AAGGTGHVIE YTGSAIRALS IEGRLTISNM AIEGGARAGL
IAPDETTFAY LKGRPYAPKG ADWDAAVAYW KSLTTDPGAT YDKVVVIDAA DIAPSVTWGT
SPEDVVPITG TVPDPASFSD PSKRAAAAKS LAYMGLEPGT RMQDVPVENI FIGSCTNSRI
EDLRAAAAVL KGRRKAPGVK WAIVVPGSGL VKAQAEAEGL DRIFIDAGLE WREPGCSACL
AMNPDKVPAG ERCASTSNRN FVGRQGPGAR THLVSPAMAA AAAVTGKLTD VRELMA