Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2294 |
Symbol | |
ID | 4078478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2412716 |
End bp | 2413876 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638007616 |
Product | carbamoyl phosphate synthase small subunit |
Protein accession | YP_614288 |
Protein GI | 99082134 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0505] Carbamoylphosphate synthase small subunit |
TIGRFAM ID | [TIGR01368] carbamoyl-phosphate synthase, small subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGAAA CCGTTTCGCC GAAGCCAACC GCCTGCCTGG CCCTTGCAGA TGGCACCGTG TTCTATGGAC ACGGGTTTGG CGCCACCGGG CGATGCGTGG CCGAGCTGTG CTTTAACACC GCAATGACCG GCTATCAGGA AATCATGACC GACCCTTCCT ATGCAGGTCA GGTCGTCACC TTCACCTTCC CCCACATCGG CAACACCGGC GTCACCCCCG AGGATGACGA AACCGCCGAC CCGGTGGCCG CAGGCATGGT GGTGAAATGG GATCCGACGC TGTCCTCCAA CTGGCGCGCC ACCGAAGAGC TGAAGTCCTG GCTCACCCGC ACGGGCCGCA TCGCCATCGG CGGCGTGGAC ACCCGCCGTC TGACTCGCGC GATCCGCCAG CAGGGCGCGC CGCATGTGGC GATGGAGCAT AACCCGGACG GGAATTTCGA TCTTGAGGCG CTGGTCGCCG CCGCCCGCGC CTGGCCCGGC CTTGAGGGCA TGGACCTCGC CAAGGACGTG ACCTGCGCGC AGTCCTACCG CTGGGATGAG ATGCGTTGGG CCTGGCCCGA GGGCTACACC CGTCAGGAAG AGCCCAAGCA CAAGGTGGTC GCCATCGACT ATGGTGCCAA GCGCAACATC CTGCGCTGCC TCGCCTCGGC GGGCTGCGAT GTCACCGTGC TGCCGGCCAC CGCAACCTCG GAAGAGGTAC TGGCCCATGG CCCTGATGGT GTGTTCCTCT CCAATGGCCC CGGCGACCCG GCCGCAACCG GCGCATACGC TGTGCCGATG ATCAAGGAAA TCCTGGATAA GACCGACTTG CCGGTCTTTG GGATCTGTCT GGGCCACCAG ATGCTCGCAC TCGCTTTGGG GGCCAAGACC ACCAAGATGA ACCACGGCCA CCACGGCGCC AACCACCCGG TCAAGGAACA CGGCACCGGC AAGGTGGAGA TCACGTCGAT GAACCACGGC TTTGCAGTGG ATGCTCAAAC CCTGCCCGAG GGCGTCGAAG AGACCCATGT CTCGCTGTTT GACGGCTCCA ACTGCGGCAT TCGCATGACC GATCGCCCGG TCTACTCCGT GCAGCACCAC CCCGAGGCCA GCCCCGGCCC GCAGGACAGT TTCTATCTGT TCGAGCGCTT TGCAGAGGCG ATGGCCGCGC GCAAGGCCTG A
|
Protein sequence | MVETVSPKPT ACLALADGTV FYGHGFGATG RCVAELCFNT AMTGYQEIMT DPSYAGQVVT FTFPHIGNTG VTPEDDETAD PVAAGMVVKW DPTLSSNWRA TEELKSWLTR TGRIAIGGVD TRRLTRAIRQ QGAPHVAMEH NPDGNFDLEA LVAAARAWPG LEGMDLAKDV TCAQSYRWDE MRWAWPEGYT RQEEPKHKVV AIDYGAKRNI LRCLASAGCD VTVLPATATS EEVLAHGPDG VFLSNGPGDP AATGAYAVPM IKEILDKTDL PVFGICLGHQ MLALALGAKT TKMNHGHHGA NHPVKEHGTG KVEITSMNHG FAVDAQTLPE GVEETHVSLF DGSNCGIRMT DRPVYSVQHH PEASPGPQDS FYLFERFAEA MAARKA
|
| |