Gene GSU3099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3099 
SymbolhisC 
ID2688464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3402517 
End bp3403569 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content62% 
IMG OID637127792 
Producthistidinol-phosphate aminotransferase 
Protein accessionNP_954140 
Protein GI39998189 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCCCT TTCGCTCAAA TATTGCCGCC ATGGCAGGCT ATGTCCCCGG TTACCAACCT 
CCGGACGTGG CGTCGTGGAT CAAGCTGAAC ACCAATGAGA ACCCCTATCC GCCGTCGCCC
GAGGTGGTGA AGGCAATCCT GGCGGAGTTA GGGGGTGACG GGGCGCTCCT GCGCACCTAT
CCCAGCGCTT CGAGCCAGGT GCTGCGGGAG ACCGTGGGCG AGCTGTTCGG CTTCGATCCC
GCGTGGATCA TCATGGCCAA CGGCTCTGAC GAGGTCCTCA ATAACCTGAT ACGGGCCTTT
GCCGGCGAGG GGGAGGAAAT TGGCTACGTG CATCCCTCCT ACTCCTACTA CGCGACCCTG
GCCGAAATCC AGGGGGCACG GGTGCGGACT TTCGGCCTTA CGGATGACCT TCGCATTGCC
GGTTTTCCCG GCCGCTACGA GGGAAAGCTC TTCTTCCTGA CCACCCCGAA TTCGCCGCTG
GGCTTCGCTT TTCCCCTTGC CTACATCGAG GAACTGGCAA CCCGCTGTGC CGGGGTCCTG
GTGGTTGACG AGGCCTATGC CGATTTCGCC GACGGTGATG CCTTGGATCT GGTGCGGCGA
CACGAGAACG TGGTCGTGAC CCGTACCCTG TCCAAGAGCT ATTCCCTGGC CGGGATGCGG
CTTGGTTTCG CCGTGGCCCG TCCGGCGGTG ATTGCGGCCC TGGACAAGAT CCGCGATCAC
TATAACCTTG ACCGTCTCGC CCAGGCCGCC TGCGTGGCCT CCCTGCGGGA TCAGACATAC
TTTGCCGGGT GTACCCGCCT GATCCGCGAG ACCCGCGAGT GGTTTTCCGC TGAAATCCGG
ACGCTCGGCT ACGAGGTGAT CCCCTCCCAG GGGAACTTCG TGTTTGCCGC GCCGCCGGAC
CGTGACGGTA AACGGGTCTA CGACGGCCTC TACTCCCGAA AGATCCTGGT TCGTCATTTC
TCCGACCCGC TCCTGGCCCA TGGCATGAGG ATTTCCATCG GCACGCGGGA GGAGATGGAG
GCGACTCTCG CCGCCCTGAA AGAGATTGGC TAA
 
Protein sequence
MLPFRSNIAA MAGYVPGYQP PDVASWIKLN TNENPYPPSP EVVKAILAEL GGDGALLRTY 
PSASSQVLRE TVGELFGFDP AWIIMANGSD EVLNNLIRAF AGEGEEIGYV HPSYSYYATL
AEIQGARVRT FGLTDDLRIA GFPGRYEGKL FFLTTPNSPL GFAFPLAYIE ELATRCAGVL
VVDEAYADFA DGDALDLVRR HENVVVTRTL SKSYSLAGMR LGFAVARPAV IAALDKIRDH
YNLDRLAQAA CVASLRDQTY FAGCTRLIRE TREWFSAEIR TLGYEVIPSQ GNFVFAAPPD
RDGKRVYDGL YSRKILVRHF SDPLLAHGMR ISIGTREEME ATLAALKEIG