Gene Phep_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1898 
Symbol 
ID8253002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2191622 
End bp2192761 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content43% 
IMG OID644935549 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_003092168 
Protein GI255531796 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.255971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAC AAATAAGTGA TCTGAAATTG GGAATATTGG GCGGCGGACA ACTCGGCAGA 
ATGCTGATCC AGGAAGCCAT CAATTATAAC CTTACGACCC TGGTTTTAGA TCCGGATACC
GATGCTCCCT GTAAACATCT TGCAAATTAC TTTGAATGTG GCTCTATTAC CGATTTTGAC
ACCGTTTACA ACTTCGGCAA AAAAGCAGAT ATCATTACCA TAGAAATAGA AAAGGTAAAT
ATTGAAGCAT TGGAACAACT GGAAAAGGAA GGAAAACAGG TTTTCCCGCA ATCCCGGGTA
ATCCGCCTGA TCCAGGACAA GGGTGTTCAA AAACAGTTTT TTAAAGAAAA CAACATCCCA
ACAGCACCTT TTCAGCTGGT AAATACCAGA GAAGAGATGC GCCACAGCAG GTTTGCGTTT
CCTTATATAC TTAAACAGCG CCGGGATGGC TACGACGGTA AAGGCGTGAT GAAAATAAAC
CATGCAGCTG ATATTGAGCA GGCCTTTGAT GCGCCCTGCC TGATTGAGGA AATGATAGAC
TTTGAAAAAG AGATTGCCGT TATTGTGGCC AGAAACGCTA ATGGGGACAT GAAAACTTTT
CCGATGGTAG AAATGGAATT CAATGCCGAG GCCAATCTGG TCGAGTTCCT GATCTCTCCT
TCTACTTATC CTGAAGCGCT TCAGAACAAG GCCGAGGTAA TTGCCAAAAA CATCGCTTCC
TCCCTTAACA TCACCGGCTT ACTGGCCGTA GAAATGTTTG TGACCAGGAA TGGGGAGCTG
CTGGTCAATG AGCTGGCACC AAGACCACAC AATAGCGGAC ATCAAACCAT TGAGGGCAAT
TATGTTTCTC AGTTTGACCA GCATTTAAGG GCAATTTTTA ACCTGCCATT GGGCGATACA
CGCAGCATCA GCAATGCAGT GATGATTAAT CTGCTGGGCG AGAAAAACCA TAATGGGGTA
GCCAAATATC AGGGATTGGA AAAAACCATG GCCATTGATG GGGTATATAT CCATCTTTAC
GGTAAAAAAT ACACCAAACC TTTCCGCAAA ATGGGCCATG TTACGGTGGT AGACCAAAAC
CGGGAAAGTG CAGTACAGAA AGCAAATTAT ATTAAAAATA CATTAAAAGT TATTTCATAA
 
Protein sequence
MAKQISDLKL GILGGGQLGR MLIQEAINYN LTTLVLDPDT DAPCKHLANY FECGSITDFD 
TVYNFGKKAD IITIEIEKVN IEALEQLEKE GKQVFPQSRV IRLIQDKGVQ KQFFKENNIP
TAPFQLVNTR EEMRHSRFAF PYILKQRRDG YDGKGVMKIN HAADIEQAFD APCLIEEMID
FEKEIAVIVA RNANGDMKTF PMVEMEFNAE ANLVEFLISP STYPEALQNK AEVIAKNIAS
SLNITGLLAV EMFVTRNGEL LVNELAPRPH NSGHQTIEGN YVSQFDQHLR AIFNLPLGDT
RSISNAVMIN LLGEKNHNGV AKYQGLEKTM AIDGVYIHLY GKKYTKPFRK MGHVTVVDQN
RESAVQKANY IKNTLKVIS