Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1261 |
Symbol | |
ID | 4785838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1357698 |
End bp | 1358849 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640089827 |
Product | carbamoyl-phosphate synthase small subunit |
Protein accession | YP_001020458 |
Protein GI | 124266454 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0505] Carbamoylphosphate synthase small subunit |
TIGRFAM ID | [TIGR01368] carbamoyl-phosphate synthase, small subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00672288 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGCTGCCAG ATCTGCCCCT CGCTCTACTT GCTCTCGCAG ACGGCACGGT CTTTCAAGGT ACCTCGATCG GCGCGGCCGG TCGCACGGTC GGCGAGGTGG TGTTCAACAC CGCGCTCACC GGCTACCAGG AAATCCTCAC CGACCCGAGC TACTGCCGGC AGATCGTCAC GCTGACGTAT CCGCACATCG GCAACACCGG CGTCAACGAG GAGGATGTCG AAGCCTCGAA AGTCCATGCC GCCGGCCTGA TCATCAAGGA CCTGCCAGTC CTCGACTCGA ATTTCCGCAA GACCCGCACG CTGTCGCAGT ACCTGCGCGA CGAAGGAACG GTGGCGATCG CCGACATCGA CACCCGCAAG CTCACCCGCA TCCTGCGCAC CGGCGGGGCC CAGAACGGCT GCATCGTGGC GCTGCCGCGC GGGGAGACCT TCACCGAGGC GCTGGTCGCC GACGCGGTGG CCCGGGCGCG CAGTGCGCAG AGCATGGCAG GCCTGGATCT CGCCAAGGTC GTGAGCACTG CGATGCCCTA CGGTTGGACC GAGACCGAGT GGTCGCTGGG CACCGGCTAC GGCACCCAGG CGGCGCCGAA GTTCCACGTC GTCGCCTATG ACTTCGGCGT CAAGCGCAAC ATCCTGCGCA TGCTGGCCAG CCGCGGCTGC CAGGTCACGG TGGTGCCGGC CAAGACGCCG GCCGCGGAGG CGCTGCTGCT GAAGCCGGAC GGCATCTTCC TCAGCAACGG CCCGGGCGAC CCCGAGCCCT GCGACTACGC GATCGAGGCA ACGCGCACGC TGATCGACAC CGGCCTGCCG GTGTTCGGCA TCTGTCTGGG TCACCAGATC ATGGCGCTGG CTTCGGGGGC GAAGACCTTC AAGATGAAGT TCGGGCACCA CGGCGCCAAC CACCCGGTAA AGGACCTGGA CGACGGCCGC GTCAGCATCA CGAGCCAGAA CCACGGTTTC GCGGTCGACG AGAAGTCCCT GCCGGCCACC CTGCGCCCCA CGCACGTGAG CCTGTTCGAC GGCACGCTGC AGGGGCTGGC GCGCACCGAC AAGCCCGCCT TCTGCTTCCA GGGCCACCCG GAAGCGTCTC CCGGGCCGCA CGACATCGGC TACCTGTTCG ACCGCTTCAT TGCGCTGATG GAAAAGAACT GA
|
Protein sequence | MLPDLPLALL ALADGTVFQG TSIGAAGRTV GEVVFNTALT GYQEILTDPS YCRQIVTLTY PHIGNTGVNE EDVEASKVHA AGLIIKDLPV LDSNFRKTRT LSQYLRDEGT VAIADIDTRK LTRILRTGGA QNGCIVALPR GETFTEALVA DAVARARSAQ SMAGLDLAKV VSTAMPYGWT ETEWSLGTGY GTQAAPKFHV VAYDFGVKRN ILRMLASRGC QVTVVPAKTP AAEALLLKPD GIFLSNGPGD PEPCDYAIEA TRTLIDTGLP VFGICLGHQI MALASGAKTF KMKFGHHGAN HPVKDLDDGR VSITSQNHGF AVDEKSLPAT LRPTHVSLFD GTLQGLARTD KPAFCFQGHP EASPGPHDIG YLFDRFIALM EKN
|
| |