Gene Cphamn1_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1049 
Symbol 
ID6374720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1137419 
End bp1138909 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content52% 
IMG OID642683550 
Productinosine-5'-monophosphate dehydrogenase 
Protein accessionYP_001959471 
Protein GI189500001 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0516] IMP dehydrogenase/GMP reductase 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0106058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAA TACTCTACGA AGCGCTTACG TTTGATGATG TGTTGCTCGT ACCCGCTTAC 
TCGGCGATTC TTCCTAAAGA AACGAGCGTC AAGACCCGCC TCACGAAAAA CATTCAGCTG
AATATTCCGC TGGTCAGTGC GGCTATGGAT ACGGTTACGG AATCTGAGTT GTCAATCGCT
ATCGCTCGCT CCGGCGGTAT AGGTTTCATC CACAAAAACC TGACAATCAG CCAGCAGGCA
AAGGAAGTTG CGAAAGTCAA GCGGTATGAA AGCGGGATTA TCCGCAACCC TGTCACCCTT
TATGAAAACG CGACCGTACA GGCGGCTCTT GACCTGATGC AGAAGCACTC GATATCCGGC
ATTCCGATTA TTGAAGAACC TATAGGGCCT GATGACGCCT CTCTGAAACT TAAAGGAATC
ATCACGAACA GGGACCTTCG CTTCAAACCT TCTCCGGACC AGAAGATTTC AAGCATCATG
ACAAGCAGGA ACCTTATCAC CGCGGATGAA GATATAAACC TCGAAGACGC GGCAGGAATA
CTGCTTGAAA ACAAAATCGA AAAACTGCTG ATAACTGATG GCAAAGGCAA CCTTAAAGGT
TTGATAACCT TTAAGGATAT TCAGAAAAGA AAACTCTATC CTGACTCCTG CAAAGATGAA
GATGGCAGGC TTAGAGCCGG CGCGGCTGTC GGCATTCGTG CAGACACCAT AGACCGGGTA
ACCGCTCTTG TCGAGGCAGG AGTGGATGTT GTCGCGGTCG ATACTGCGCA TGGTCACAGT
AAGGCTGTCT CTGATATGGT GAGAACCATC AAGAAAAGCT TCCCTGATCT TCAGGTGGTC
GCAGGAAACG TTGCTACCGC CGATGCCGTC CGGGATCTCG TTGCGGCAGG CGCGGATGCC
GTCAAAGTCG GTATCGGACC GGGCAGCATC TGTACGACTC GTGTTGTTGC GGGCGTCGGC
ATGCCGCAGC TGACTGCTGT CATGAAATGC GCGGAAGAAG CGGCCAAAAC AGGAACGCCG
CTCATCGCTG ACGGCGGCAT CAAATACAGC GGAGACATCG CCAAGGCTAT TGCCGCAGGC
GCCGATTCAG TAATGATCGG CAGCATCTTT GCCGGAACGG ATGAAAGTCC CGGGGAAACG
ATACTCTATG AAGGGAGGCG CTTCAAGGCA TACAGGGGAA TGGGCTCGCT TGGAGCCATG
TCGGAACCGG AAGGAAGCAG CGACCGGTAT TTCCAGGATG CTTCAAAAGA AAGCAAAAAA
TACGTTCCGG AAGGGATTGA AGGCCGGATA CCGGCAAAAG GAAAGCTGGA AGAGGTTATC
TATCAGCTGA TCGGTGGCCT GAAATCGTCG ATGGGTTACT GCGGTGTACG CTCTACGGAT
GAAATGAAAA ACAACACCAG CTTTGTGCGT ATCACCCAGG CCGGACTGAG AGAGAGCCAT
CCCCATGATG TCAAGATCAC CAAAGAAGCC CCGAACTACT CGGTGTCTTA G
 
Protein sequence
MSKILYEALT FDDVLLVPAY SAILPKETSV KTRLTKNIQL NIPLVSAAMD TVTESELSIA 
IARSGGIGFI HKNLTISQQA KEVAKVKRYE SGIIRNPVTL YENATVQAAL DLMQKHSISG
IPIIEEPIGP DDASLKLKGI ITNRDLRFKP SPDQKISSIM TSRNLITADE DINLEDAAGI
LLENKIEKLL ITDGKGNLKG LITFKDIQKR KLYPDSCKDE DGRLRAGAAV GIRADTIDRV
TALVEAGVDV VAVDTAHGHS KAVSDMVRTI KKSFPDLQVV AGNVATADAV RDLVAAGADA
VKVGIGPGSI CTTRVVAGVG MPQLTAVMKC AEEAAKTGTP LIADGGIKYS GDIAKAIAAG
ADSVMIGSIF AGTDESPGET ILYEGRRFKA YRGMGSLGAM SEPEGSSDRY FQDASKESKK
YVPEGIEGRI PAKGKLEEVI YQLIGGLKSS MGYCGVRSTD EMKNNTSFVR ITQAGLRESH
PHDVKITKEA PNYSVS