Gene GSU1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1895 
SymbolpyrG 
ID2686232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2072273 
End bp2073883 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content60% 
IMG OID637126586 
ProductCTP synthetase 
Protein accessionNP_952944 
Protein GI39996993 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCA AGTTCATCTT CGTAACCGGT GGCGTTGTGT CCTCCATCGG CAAAGGGCTT 
GCCTCGGCGT CGCTCGGGGC GCTGCTGGAG TCCCGCGGCC TGCGGGTAAC CATGCAGAAG
CTGGACCCTT ACATCAACGT TGACCCCGGC ACCATGTCGC CGTTCCAGCA CGGCGAGGTT
TTTGTCACCG ACGATGGCGC CGAGACCGAC CTGGACCTGG GGCACTACGA ACGGTACACC
TCTGCCCGGC TCTCCAAACG GAGCAATTTC ACCACGGGCC AAGTCTATTT TTCGGTCATC
GAGAAGGAAC GGCGCGGCGA TTATCTGGGG GGGACCGTGC AGGTTATTCC CCATATCACC
GACGAGATCA AGCACAAGAT CCTGGAGAAC GCCAAAGGGG CCGATGTGGC CATCGTGGAG
GTCGGGGGAA CGGTCGGCGA CATCGAATCG CTCCCGTTCC TGGAGGCGAT CCGGCAGTTC
AAGGCGGACC GGGGGGCGGG GAACGTTCTT TACATTCACG TGACGCTCGT GCCCCACATC
AAGACGGCCG GTGAGTTGAA GACCAAGCCG ACGCAGCACT CCGTCAAGGA GTTGCGCGAG
ATCGGTATCC AGCCCGACAT TCTCATCTGC CGTTGCGAGA TGGAGCTGCC CCGGGACATG
AAGGCCAAGA TCGCCCTGTT CTGCAACGTG GAAGAGAAGG CGGTCATCAC CTCTACCGAT
GCGGAACATA TCTATGCGGT ACCCCTGGCG CTTCACAAGG AAGGTCTCGA CGAGCAGGTA
GTAGAAAAGC TCAACATCTG GACCAAGGCG CCTGATCTCA GTCCCTGGCA CAGTGTTGTG
GAAAAACTTC GCTCGCCCCT GCGGGGAGAG GTCCGCATTG CCATCGTCGG CAAGTACGTT
AATCTCACCG AGTCATACAA GTCCCTGTCC GAAGCCCTTA CCCACGGCGG GATCGCCAAC
GACTGCCGGG TGGTTCTCAC CTATCTGGAC TCCGAGCGGA TCGAAAGCGA GGGGATCGGC
AGCTCCTTCG ATGACATCGA TGCCATCCTG GTGCCGGGAG GGTTCGGCGA GCGGGGAACC
GAGGGCAAGA TCAAGGCCAT CGAGTACGCC CGCACCCAAA AGATTCCTTT CTTCGGCATC
TGTCTCGGCA TGCAGATGGC GGTGGTGGAG TATGCCCGCA ACGTTTGTGG GCTCGAAGAT
GCCTGTTCCA GCGAGTTCCG TCCCGATTGC GCCAACCCGG TCATCAGTCT GATGGAGGAA
CAACGCGACA TCGACCGGCT GGGCGGCACC ATGCGGCTCG GTGCCTATCC CTGCAGCCTT
ACCAAGGGAA CTTTTGCCCA AAAAGCGTAC GGGTCCCTGG AAATATCCGA ACGTCACCGT
CACCGCTACG AGTACAATAA CGCCTTCCGT GAGACCCTTG TGGCCAATGG CCTCGTCGTG
TCGGGGCTCT ACAAGGAAGG GGATCTGGTG GAGATCGTGG AGGTCGCCGA CCACCCTTGG
TTCCTCGGCT GCCAGTTCCA CCCGGAATTC AAGTCAAAGC CGCTTAACCC CCACCCTCTG
TTCAGGGCAT TTATCGCAGC CGCCCTTGAT CGGAAAGACA AAAGGCGATA G
 
Protein sequence
MKTKFIFVTG GVVSSIGKGL ASASLGALLE SRGLRVTMQK LDPYINVDPG TMSPFQHGEV 
FVTDDGAETD LDLGHYERYT SARLSKRSNF TTGQVYFSVI EKERRGDYLG GTVQVIPHIT
DEIKHKILEN AKGADVAIVE VGGTVGDIES LPFLEAIRQF KADRGAGNVL YIHVTLVPHI
KTAGELKTKP TQHSVKELRE IGIQPDILIC RCEMELPRDM KAKIALFCNV EEKAVITSTD
AEHIYAVPLA LHKEGLDEQV VEKLNIWTKA PDLSPWHSVV EKLRSPLRGE VRIAIVGKYV
NLTESYKSLS EALTHGGIAN DCRVVLTYLD SERIESEGIG SSFDDIDAIL VPGGFGERGT
EGKIKAIEYA RTQKIPFFGI CLGMQMAVVE YARNVCGLED ACSSEFRPDC ANPVISLMEE
QRDIDRLGGT MRLGAYPCSL TKGTFAQKAY GSLEISERHR HRYEYNNAFR ETLVANGLVV
SGLYKEGDLV EIVEVADHPW FLGCQFHPEF KSKPLNPHPL FRAFIAAALD RKDKRR