Gene Apar_0189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0189 
Symbol 
ID8413037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp223435 
End bp225024 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content52% 
IMG OID645021761 
ProductGMP synthase, large subunit 
Protein accessionYP_003179216 
Protein GI257783999 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0433911 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGT CCAGGCCCAA GCAGTTTGTT GCAGTTCTAG ACTTTGGTGC CCAGTACGGC 
CAGTTAATTG CACGTCGCGT GCGCGATCTT AACGTCTACT CCGAGATTGT TCCTTGTGAT
ATTTCCGCAG ATGAGCTTCG TGAGCTTAAT CCATCTGCGC TTATTTTGTC CGGCGGCCCT
GCTTCCGTTT ACGCCGAGGA CGCGCCAAAG ATTGACCCAG AGATTCTGGA GCTTGGTATT
CCTGTCTTTG GTTTCTGCTA TGGACAGCAG ATTATGGCGG TTACCCTGGG CGGCACTGTT
GGACATACCG AGAAGGGCGA GTATGGCCCA GCTCATCTGA CTCGTGCAGG GGAGAGTCGT
ATTTTTGACG GCACCGCTGA GCAGCAGACC GTTTGGATGA GCCACCGCGA CGCTGTCTCT
GAGGTTCCAG ATGGCTTTAC CGTTACCGCT TCTACCGACG TTTGCCCTAT TGCAGCTATG
GAAAACGCTG CTAAGAATCT ATATTCAACT CAGTTTCATC CAGAGGTCAA CCACACCGAA
TGTGGTTCCC AGATGCTTTC TAACTTTCTA TTTAATATCT GTGGCTTTGA AAAGACTTGG
ACTATGGACA ACATCATTGA GCAGAAAGTG GAGGAGATTC GCCAGAAGGT TGGTAATGGA
CGTGTCATTT TGGCGCTCTC CGGTGGCGTA GACTCTTCCG TTGTTGCCGC TCTTGTCCAT
CGTGCTATTG GTGATCAGCT GACCTGCGTG TTTGTCAATC ACGGTATGCT TCGTAAGGGC
GAACCAGAGA TGGTTGAGCA GGTCTTCTGC AAGCAGTTTA ACGTGCCTTT GATTCACGTT
CACGCGGAGG AGCGCTACGC AGAGCTTTTA GCTGGCGTTA CTGAGCCAGA GAAGAAGCGT
CGTCTGATTG GTACCGAGTT CTGGAAGGTC TTCTTTGATG AGGCTCAGAA GCTGGATGGC
GTTCAGTTCC TGGCACAGGG CACCATTTAT CCTGACATTA TTGAGTCTGG CGCTCGTAAG
ACGGGCGGTA AGGCTGCAAC CATCAAGAGC CACCACAACC TGATTCCATT CCCAGAAGGC
GTTCACTTTG ACCTGATTGA GCCTCTGGAT CACTTCTTCA AGGACGAGGT CCGCGCGCTG
GGCGTTTCTC TTGGTCTGCC AGAGAACCTT GTCTACAGAC AGCCTTTCCC AGGTCCTGGT
CTTGCTATCC GCATCATTGG TGACGTTACC CCAGAAAAGC TGGAAATTCT TCGCAACGCA
GATGCAATTG TCCGAGAAGA GATTGACGCT TACAATGCTC AGCTCTTTGA CGAGACAGGC
GATCGTAACT CCGAGCACAG TGTTTGGCAG TACTTTGCTG TGCTACCCGA CATTAAGTCC
GTTGGTGTTA TGGGTGATGA GCGCACGTAT GCTCGTCCAG TTATCCTGCG CGCCGTTGAG
TCCAGTGACG CTATGACCGC TGACTGGGCA AAGCTCCCAT ATGAGCTGCT AACTCGCATT
TCTTCTAGGA TTGTTAGCGA GGTTGCTGGT GTTAACCGCG TAGCATACGA CATTACTCCT
AAGCCACCTG CGACTATTGA GTGGGAGTAG
 
Protein sequence
MSESRPKQFV AVLDFGAQYG QLIARRVRDL NVYSEIVPCD ISADELRELN PSALILSGGP 
ASVYAEDAPK IDPEILELGI PVFGFCYGQQ IMAVTLGGTV GHTEKGEYGP AHLTRAGESR
IFDGTAEQQT VWMSHRDAVS EVPDGFTVTA STDVCPIAAM ENAAKNLYST QFHPEVNHTE
CGSQMLSNFL FNICGFEKTW TMDNIIEQKV EEIRQKVGNG RVILALSGGV DSSVVAALVH
RAIGDQLTCV FVNHGMLRKG EPEMVEQVFC KQFNVPLIHV HAEERYAELL AGVTEPEKKR
RLIGTEFWKV FFDEAQKLDG VQFLAQGTIY PDIIESGARK TGGKAATIKS HHNLIPFPEG
VHFDLIEPLD HFFKDEVRAL GVSLGLPENL VYRQPFPGPG LAIRIIGDVT PEKLEILRNA
DAIVREEIDA YNAQLFDETG DRNSEHSVWQ YFAVLPDIKS VGVMGDERTY ARPVILRAVE
SSDAMTADWA KLPYELLTRI SSRIVSEVAG VNRVAYDITP KPPATIEWE