Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_09091 |
Symbol | carA |
ID | 4717616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 781067 |
End bp | 782206 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640078622 |
Product | carbamoyl phosphate synthase small subunit |
Protein accession | YP_001009300 |
Protein GI | 123968442 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0505] Carbamoylphosphate synthase small subunit |
TIGRFAM ID | [TIGR01368] carbamoyl-phosphate synthase, small subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.347132 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAATC CATATAAGAA AAACGCGAAA TTAGTTTTAA GTAATGGTAT TGTATTTCCG GGATTCTTTT TTGGTGCTTC TGGTACTGCA GTTGGTGAAA TAGTTTTTAA TACGGGAATG ACCGGATATC AGGAAGTTAT TACTGATCCT AGTTATTATG GACAAATTTT AACGTTTACT TATCCAGAGA TTGGAAATAC TGGTATTAAT TTTGAAGATT CAGAATCCAA TATTAATATC AAAGGTATAA TTGTTAGAAA TTTTTCATCT AATAACAGCA ATTGGAGATC TAAAAAGAAT TTTAATCAAT GGTTAGTGGA GAAAAATATC ATTGGTCTAT ACGGAATTGA TACAAGGGCT CTTGTTAAGA TTTTAAGATC TACTGGCGCA ATGAATGGAG TTATTACCTC TCTAGATAAA ACTGATGAAA GTTGTTTAAA GATAATTCAG GATACACCAA AGATGGAGGG ATTAAATTTA TCAAAAGTTG TTTCAACAAA GCAACAATAT TTATGGAATA GTCATACACA AACAGTTTTT GATTTAAGAA AAAGATATGA TGAATCTTCT AAAAAGTTAA AAATAGTGGC AATTGATTTT GGAATTAAAA ATTCAATTTT AAATAGACTC GTATCCCATG GTTGTGAAGT TTTGGTTTTA CCTTCTCGAT CTTCTCTAAA TGATGTCCTG TCTAACAAGC CGGATGGTAT ATTTTTCTCA AATGGTCCAG GCGATCCTTC TACTGTTTCT GAAGGTATAG ACCTAGCAAA ATCACTCATA GAATATGGTG AAATTCCTAT GTTTGGTATA TGCCTTGGCC ACCAAATATT TGGATTAGCA CTAGGTGGCT CAACTTATAA ATTATCTTTT GGCCATCGTG GTTTAAATCA CCCTTGTGGT CAAAATAACA AGATAGAGAT AACTAGTCAG AACCACGGTT TTGCTATTGA TCCTAATTCT CTACCAAAGA ATTTAGTTAG AATAACCCAT TACAACCTTA ACGATAATAC TGTGGCTGGC CTAGAAGTTA ATAATAAGCC AATATTTAGT GTGCAATATC ATCCAGAAGC AGGACCTGGC CCACATGATT CAGATTATTT ATTTAAAAAA TTTGTTTCTC TAATGTTAGA AAGATGTTGA
|
Protein sequence | MINPYKKNAK LVLSNGIVFP GFFFGASGTA VGEIVFNTGM TGYQEVITDP SYYGQILTFT YPEIGNTGIN FEDSESNINI KGIIVRNFSS NNSNWRSKKN FNQWLVEKNI IGLYGIDTRA LVKILRSTGA MNGVITSLDK TDESCLKIIQ DTPKMEGLNL SKVVSTKQQY LWNSHTQTVF DLRKRYDESS KKLKIVAIDF GIKNSILNRL VSHGCEVLVL PSRSSLNDVL SNKPDGIFFS NGPGDPSTVS EGIDLAKSLI EYGEIPMFGI CLGHQIFGLA LGGSTYKLSF GHRGLNHPCG QNNKIEITSQ NHGFAIDPNS LPKNLVRITH YNLNDNTVAG LEVNNKPIFS VQYHPEAGPG PHDSDYLFKK FVSLMLERC
|
| |