Gene Acid345_3670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3670 
SymbolpyrG 
ID4072273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4340071 
End bp4341747 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content59% 
IMG OID637985693 
ProductCTP synthetase 
Protein accessionYP_592745 
Protein GI94970697 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCAA AGTACATCTT TGTCACCGGC GGAGTTGTGT CGTCGCTAGG TAAGGGATTG 
GCAGCAGCTT CCATCGGCTG TCTGCTGGAA ATGCGTGGTC TCAAGGTCAA CATGCAAAAG
TTTGACCCGT ATCTCAATGT CGATCCCGGG ACCATGTCGC CGTTCCAGCA CGGCGAAGTC
TTCGTCACCG ACGACGGCGC CGAGACCGAT CTCGACCTGG GCCACTACGA GCGCTATACG
CATAGCAAGC TCACGCGCGA GAACAACTGG ACCACCGGCC GCATTTACGA GCAGATCATC
ACCAAGGAAC GCCGCGGCGA TTATCTCGGC AAGACCGTCC AGGTCATTCC GCACGTCACC
AACGAGATCA AGGCCGCTAT GAAGCGTGCC GCCGTGGATG TGGACGTTGC TATCGTCGAA
ATTGGCGGCA CAGTTGGCGA CATCGAATCG CTGCCATTCA TCGAAGCCAT TCGCCAGATG
CGCCAGGAGC TCGGCCGCGA CAACACGCTC TTCGTACACC TCACACTGGT GCCCTACATT
GCCGCTGCCG GCGAGTTGAA GACCAAGCCC ACGCAGCACT CGGTGAAAGA GCTGCTCAGC
ATCGGAATTC AGCCCGACAT ATTGCTGTGC CGCACCGACC GCTTCCTGTC GAAAGACATC
AAAGGCAAGA TCGCGCTCTT CTGCAACGTT GAGGACGAAG CCGTCATCAC CGCGAAAGAC
GTGGCTTCCA TCTACGAAGT TCCGCTCGGC TTTCATCATG AAGGCGTCGA TCGCCTGGTG
ATGAAATATC TGCGTCTCGA TGCGAAGGAA CCCGACCTCA CCCGCTGGCA GGACATCGTC
CATCGCGTCT ACAACCCGAA AGACGAAGTC ATCATTGGCA TCATCGGGAA ATACGTGGAG
TACGAAGACT CCTACAAGTC GCTGAAGGAA GCGCTCGTCC ACGGATCGCT CGCGCACAAC
CTCAAGCTTA ATGTCACGTG GATCGAAGCA GAGGGCCTCG AGACCAAGGA CGAGAGCTAT
TACGAACAGC TTCGCCATGT GGACGGCATC CTCGTTCCCG GCGGCTTTGG CAAGCGCGGC
ATCGCCGGCA TGTTGAATGG CATCCGCTTC GCGCGGGAGC ACAAGGTCCC CTACTTCGGC
ATCTGTCTCG GCATGCAGAC GGCCTCGATC GAATTCGCGC GCAACGTCTG CGGTCTCGAA
GACGCCAACT CCAGTGAGTT CGATCCCGCT ACCCCGCACC GCGTCATCTA CAAACTGCGC
GAACTGCGCG GCGTGGAAGA ACTCGGCGGC ACTATGCGCC TCGGCGCGTG GGCTTGCAAA
CTCGAACCCG GCTCGCACGC CGCGAAAGCC TACGGAACCA CTGAGATCAG CGAGCGACAT
CGCCACCGCT ACGAATTCAA CCAGGAATAC CGCGAGCAGA TGGCCGCTGC CGGACTCAAA
TTCACTGGCA CCACGCCCGA CGGCACCTAC ATCGAGATCG TCGAGCTCGA CCAGAATGAG
CATCCGTACT TCCTCGGCTG CCAGTTCCAC CCCGAATTCA AGTCGAAGCC GCTCGAGCCG
CATCCACTTT TCAAGGCATT CATCGGCGCA TCGTACGAGC ATCGCATGAA GCGCACACAC
ACCAAGGAGC GCGAGGAAGA GTCGGTCTTC CTGCGTCCGG AGCGGGTAGG GAAGTAG
 
Protein sequence
MSAKYIFVTG GVVSSLGKGL AAASIGCLLE MRGLKVNMQK FDPYLNVDPG TMSPFQHGEV 
FVTDDGAETD LDLGHYERYT HSKLTRENNW TTGRIYEQII TKERRGDYLG KTVQVIPHVT
NEIKAAMKRA AVDVDVAIVE IGGTVGDIES LPFIEAIRQM RQELGRDNTL FVHLTLVPYI
AAAGELKTKP TQHSVKELLS IGIQPDILLC RTDRFLSKDI KGKIALFCNV EDEAVITAKD
VASIYEVPLG FHHEGVDRLV MKYLRLDAKE PDLTRWQDIV HRVYNPKDEV IIGIIGKYVE
YEDSYKSLKE ALVHGSLAHN LKLNVTWIEA EGLETKDESY YEQLRHVDGI LVPGGFGKRG
IAGMLNGIRF AREHKVPYFG ICLGMQTASI EFARNVCGLE DANSSEFDPA TPHRVIYKLR
ELRGVEELGG TMRLGAWACK LEPGSHAAKA YGTTEISERH RHRYEFNQEY REQMAAAGLK
FTGTTPDGTY IEIVELDQNE HPYFLGCQFH PEFKSKPLEP HPLFKAFIGA SYEHRMKRTH
TKEREEESVF LRPERVGK