Gene Apar_0475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0475 
SymbolpyrG 
ID8413324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp542650 
End bp544299 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content46% 
IMG OID645022043 
ProductCTP synthetase 
Protein accessionYP_003179497 
Protein GI257784280 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0504] CTP synthase (UTP-ammonia lyase) 
TIGRFAM ID[TIGR00337] CTP synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.984094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGC ACATATTTGT AACAGGCGGC GTTGTTTCTT CTCTTGGAAA AGGAATCACT 
GCTGCATCTC TCGGCCGACT TCTTAAGGCT CGTGGCTATA AGGTCATGAT GCAAAAAGCT
GACCCTTATC TCAATGTTGA TCCAGGCACC ATGAGTCCTT TCCAGCATGG TGAGGTTTTT
GTAACTGAGG ATGGTAAAGA GACTGACCTT GACCTTGGTC ATTATGAGCG TTTTATTGAT
GAGAACCTTA CTAGGGAGTC TAACTTCACC ACTGGTCTCA TTTATCAGTC TTTGATTCAG
CGTGAACGTG CTGGAGACTT TCTTGGTGGT ACAGTTCAGG TAATTCCTCA TGTAACTGAC
GCAATTAAAG CTCGTTTTGC TCGCATTGAA GAGGTTACTA ATGCTGATGT AGTTATCACG
GAGCTGGGTG GCACTATTGG CGATATTGAG TCACAACCTT TTGTTGAGGC AATTCGCCAA
TTCAGAAAAG AGCGAGGCGC AAGTAACGTT GCTATTATTC ACGTCAGTCT TGTTCCTTAT
ATCGCAGCTG CTCATGAGGT CAAGACTAAG CCTACGCAGC ACTCCGTAAA AGAGCTTCGT
TCCCTTGGTA TTCAGCCAGA CTTTATCGTA TGTCGTTCAA GCCATTCTGT GGATGAATCT
ATTCGCGAGA AAATCGCTAA TTTCTGTGAT GTTGATGCAG ATTGCGTTTT TGAGAACAAT
GATTTGCCTT CAATTTACGA CGTCCCAGCG CACTTGGCAG CACAGGGATT TGACAAGAAG
GTACTTGAGC GTCTTGGTCT TGAAGTTCGT CCAAGTGATC TTGGTGGCTG GGAAGCATTT
ACTACTGCTA TGCATAAGGC AAATGCGCTT GAAGACACAA CAAGAATTTA TGTTGTTGGT
AAGTATACGC AGTTACCTGA TGCATATCTT TCCGTTATTG AGGCACTTCA CCACTCTGGT
ATTTTCTACG GCAGACACGT TGATATCCGT CTGGTAAATG GTGAAGAGCT AACAGAAGAA
GACGTGGAGC AAGAGCTTGC CGGCGCAGAT GGTATTTTGG TTCCCGGCGG CTTTGGTCTT
CGTGGTGTAG AAGGCAAGAT GGTTGCTATT CGTCGTGCCC GTGAGCTTAA GATTCCTTAT
CTTGGTGTCT GCCTTGGTAT GCAGATGGCT GTTACTGAGT TTGCTCGTGA TGTTTGTGGA
ATGGAGGGCG CAAATTCAGC AGAGTTTGGT CCAGATACTC CATATCCTGT CATCGATCTT
ATGCCTGATC AGGAGGATAT TACCGATAAA GGCGGTACTA TGCGCCTTGG TTCTTATCCT
TGTAAGGTTG TTGAGGGAAC TCTTGCGCAT GAGGCTTATG GTGACAACTT GGTTTATGAG
CGTCATCGTC ACCGCTATGA GGTTAGCAAC GTATTCCGTA ATCAGCTTGT TGAGGCTGGT
TTGGTAGTTT CCGGCATTTC TCCAGACGAT CGCCTTGTAG AGATGATTGA GCTTCCAGAG
TCTGTTCACC CTTGGTTTGT TGCAAGCCAA GCACACCCAG AGTTCAAGAG CCGTCCAACT
CATCCTGCAC CTTTGTTCCG TGAGTTTGCA CGTGCAGCAA TCGCTCATCA TGAGGGTGTT
GATCGTCATG ATGTTAATCA GACTCTCTAA
 
Protein sequence
MTKHIFVTGG VVSSLGKGIT AASLGRLLKA RGYKVMMQKA DPYLNVDPGT MSPFQHGEVF 
VTEDGKETDL DLGHYERFID ENLTRESNFT TGLIYQSLIQ RERAGDFLGG TVQVIPHVTD
AIKARFARIE EVTNADVVIT ELGGTIGDIE SQPFVEAIRQ FRKERGASNV AIIHVSLVPY
IAAAHEVKTK PTQHSVKELR SLGIQPDFIV CRSSHSVDES IREKIANFCD VDADCVFENN
DLPSIYDVPA HLAAQGFDKK VLERLGLEVR PSDLGGWEAF TTAMHKANAL EDTTRIYVVG
KYTQLPDAYL SVIEALHHSG IFYGRHVDIR LVNGEELTEE DVEQELAGAD GILVPGGFGL
RGVEGKMVAI RRARELKIPY LGVCLGMQMA VTEFARDVCG MEGANSAEFG PDTPYPVIDL
MPDQEDITDK GGTMRLGSYP CKVVEGTLAH EAYGDNLVYE RHRHRYEVSN VFRNQLVEAG
LVVSGISPDD RLVEMIELPE SVHPWFVASQ AHPEFKSRPT HPAPLFREFA RAAIAHHEGV
DRHDVNQTL