Gene GSU1798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1798 
Symbol 
ID2685639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1963664 
End bp1965250 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content55% 
IMG OID637126485 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionNP_952848 
Protein GI39996897 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0044129 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTTG TAAAACTCTA CGATACGACT CTCCGCGACG GAACCCAGGC GGAGGATATT 
TCCTTCCTCG TGGAGGACAA AATCCGGATC GCCCATAAAC TCGATGAGAT AGGCATTCAC
TACATCGAGG GTGGGTGGCC CGGCAGCAAT CCGAAGGATG TTGCCTTTTT CAAGGACATC
AAGAAAGAGA AACTCTCCCA GGCGAAGATC GCTGCGTTCG GTTCCACCCG TCGTGCCAAG
GTAACCCCCG ACAAGGACCA TAACCTCAAG ACCCTCATTC AGGCGGAACC AGATGTCTGC
ACCATATTCG GCAAGACATG GGATTTTCAC GTGCATGAGG CACTGCGGAT ATCGCTCGAG
GAAAATCTTG AGTTGATTTT CGACTCGCTG GAATACCTTA AGGCGAATGT TCCCGAGGTC
TTCTACGACG CCGAGCACTT TTTCGACGGC TACAAGGCGA ACCCGGACTA CGCCATCAAG
ACCCTCAAGG CCGCTCAGGA CGCAAAGGCC GACTGCATTG TTCTCTGCGA CACCAACGGC
GGTACCATGC CCTTCGAGCT TGTCGAGATT ATCCGCGAGG TGCGCAAGCA CATCACGGCC
CCTCTCGGCA TCCACACGCA CAACGATTCA GAGTGCGCCG TTGCCAACTC CCTCCATGCG
GTCAGCGAAG GAATTGTCCA AGTCCAGGGT ACCATCAACG GTTTCGGCGA GCGCTGTGGC
AATGCCAACC TCTGCTCGAT CATCCCTGCC CTGAAACTCA AGATGAAGCG CGAGTGCATT
GGGGACGACC AGCTTAGAAA ACTTCGCGAT CTTTCACGAT TCGTCTACGA ATTGGCCAAC
CTTTCGCCCA ACAAGCATCA GGCATATGTG GGCAACTCGG CATTTGCCCA CAAGGGCGGC
GTCCACGTAT CGGCCATCCA GCGCCATCCT GAAACCTATG AGCACCTGCG GCCGGAGCTG
GTCGGCAACA TGACGCGGGT ACTCGTTTCC GACCTGTCTG GTCGCTCCAA TATCCTTGCT
AAGGCAGAAG AGTTCAATAT CAAGATGGAT AGCAAGGACC CCGTTACGCT CGAAATTCTC
GAAAATATAA AGGAAATGGA GAACCGGGGT TACCAGTTCG AAGGGGCGGA AGCGTCGTTC
GAGCTCCTCA TGAAAAGAGC CCTCGGCACC CACCGCAAGT TTTTCTCGGT GATCGGTTTC
CGGGTAATCG ACGAAAAGCG CCATGAGGAC CAGAAGCCTC TTTCTGAAGC TACAATCATG
GTCAAGGTGG GGGGCAAAAT CGAGCACACC GCAGCTGAGG GAAATGGTCC GGTGAATGCA
TTGGACAATG CTCTCCGCAA AGCCTTGGAG AAGTTTTATC CGCGCCTCAA GGAAGTAAAG
CTGCTGGACT ACAAGGTGCG CGTATTGCCG GCGGGGCAGG GGACGGCCTC GTCTATCAGG
GTGCTCATCG AGTCTGGCGA TAAAGAGAGC CGCTGGGGTA CGGTCGGTGT TTCGGAAAAC
ATTGTCGATG CATCCTACCA GGCTCTTCTG GACAGTGTGG AGTACAAGCT CCACAAAAGC
GAAGAGATCG AAGGCTCCAA GAAGTGA
 
Protein sequence
MSLVKLYDTT LRDGTQAEDI SFLVEDKIRI AHKLDEIGIH YIEGGWPGSN PKDVAFFKDI 
KKEKLSQAKI AAFGSTRRAK VTPDKDHNLK TLIQAEPDVC TIFGKTWDFH VHEALRISLE
ENLELIFDSL EYLKANVPEV FYDAEHFFDG YKANPDYAIK TLKAAQDAKA DCIVLCDTNG
GTMPFELVEI IREVRKHITA PLGIHTHNDS ECAVANSLHA VSEGIVQVQG TINGFGERCG
NANLCSIIPA LKLKMKRECI GDDQLRKLRD LSRFVYELAN LSPNKHQAYV GNSAFAHKGG
VHVSAIQRHP ETYEHLRPEL VGNMTRVLVS DLSGRSNILA KAEEFNIKMD SKDPVTLEIL
ENIKEMENRG YQFEGAEASF ELLMKRALGT HRKFFSVIGF RVIDEKRHED QKPLSEATIM
VKVGGKIEHT AAEGNGPVNA LDNALRKALE KFYPRLKEVK LLDYKVRVLP AGQGTASSIR
VLIESGDKES RWGTVGVSEN IVDASYQALL DSVEYKLHKS EEIEGSKK