Gene Cphamn1_2444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2444 
Symbol 
ID6376139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2606217 
End bp2607203 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content50% 
IMG OID642684922 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001960820 
Protein GI189501350 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACA AGACGGCAGG AGTTGATATC AGTGCTGGAG AAGAGTTTGT CCGATTGATC 
AAGCCGGAGG TGCGTCAGAC CTTTACCAGT GGTGTTCTGA CCGATATCGG CGCGTTCGGG
GGTTTTTTTC AGCCTGATTT TTCCTCATAT ACGAGTCCTG TGCTGGTCAG CAGTATAGAC
GGAGTCGGTA CGAAACTGAA GGTTGCGGCG GAAATGGGGC GATATGACAC CATTGGCGCC
TGTCTGGTGA ATCACTGTGT GGATGATATT CTTGTGTGTG GCGCGAAACC GTTGTTTTTT
CTTGACTACT ATGCTTGTGG GAAGCTTATA CCCGAAATGG CAGCGGATAT TGTCAAGGGA
ATGGTTGCGG CATGCAAAGA GAACTCCTGT GCTCTCATCG GAGGTGAAAC CGCTGAAATG
CCCGGGGTTT ACGCCACAGA TGATTTTGAT CTCGCAGGGA CGATTGTCGG TGTTGTCGAC
CAGAGCAGAA TCATTAACGG CGCGGCAATA TCCGAGGGAG ACGTCATGAT AGGGATTGCC
TCGAACGGAT TGCATACAAA CGGCTTCTCT CTGGCTCGCA AGGTTTTCGA GGGAAAGCTT
CGACATACAT TTGAGGGACT TGAAGGCAGT GTCGGTGATG AGCTTCTGAA GGTTCACCGC
TCGTATCTTC CAGCAATCGG GCCACTGCTT TCTTCAGAAG ATATTCACGG TATGTCTCAT
ATTACAGGAG GAGGTCTGAC AGGAAACACC ATGCGGATTA TACCCGACGG CCTCCGGCTT
GATGTCGACT GGAAATCCTG GCCGGAGCCG GTGATTTTCG ATATCATACG TAAAGAGGGA
AGGGTTCCCG AAGAGGATAT GCGAAGAACC TTTAATCTTG GCGTAGGCCT GGTGATGATT
GTCGCCGAAT CGTCCGTTGA GGGCATAATG GAGGATTTGA CATCGAAGCA GGAAAATGCT
TACATTATAG GCCGGGTCGT TGCCTGA
 
Protein sequence
MDYKTAGVDI SAGEEFVRLI KPEVRQTFTS GVLTDIGAFG GFFQPDFSSY TSPVLVSSID 
GVGTKLKVAA EMGRYDTIGA CLVNHCVDDI LVCGAKPLFF LDYYACGKLI PEMAADIVKG
MVAACKENSC ALIGGETAEM PGVYATDDFD LAGTIVGVVD QSRIINGAAI SEGDVMIGIA
SNGLHTNGFS LARKVFEGKL RHTFEGLEGS VGDELLKVHR SYLPAIGPLL SSEDIHGMSH
ITGGGLTGNT MRIIPDGLRL DVDWKSWPEP VIFDIIRKEG RVPEEDMRRT FNLGVGLVMI
VAESSVEGIM EDLTSKQENA YIIGRVVA