Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1473 |
Symbol | |
ID | 4241679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2235212 |
End bp | 2236372 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638106626 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_721236 |
Protein GI | 113475175 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0646926 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAAAAA TGAAAATGAA TGAAGAGGAA AAAAAACAAC AAAACAAAAT TCAGCGAGTG GGTGTTGTTG GTGGTGGTCA ATTAGCATGG ATGATGGGGG ATGCAGCAAA AAAACTAGGA GTAGATTTAA TTATTCAAAC TCCTCATCAA GATGACCCAG CAGTATCTAT TGCAAAGGAT ATAATTTTGG CAGAAATTGA TGACCCTCAG GCAACTACTA AGTTAGCAAA TATTTGTGAT GTGATTACCT TTGAGAATGA GTTTATTAAT ATAGATGAGT TATCATATCT TGCTGAAAAG AATGTAATTT TTCGTCCTAG TTTATCTGTG TTGAAGCCGT TATTAGATAA ATATGAACAG TTATGTTATT TACGATATTT GGGTTTACCT GTACCGAACT TTTGGGAGTG GGGAATAGAG ATTGAGCCTT TATCTTTCCC TTTGGTATTG AAAGCTCGTC GTCATGGTTA TGACGGTCAG GGTACTTTTA TTATTAAGAA TATTGAGAGT TTAAAATCTC AGGGAAATTC AGAATTTTTC ATACAAGAAT TTATTCCTTT TGAACGAGAG GTTGCTGTTA TTGCTGCTCG TGGAGTTACT GGGGAAGTTA AGGTTTATCC TGTGGTAGAA ACTCAACAAG AAAACCAGGT TTGCCAGCGG GTTTTTGTAC CTGATGAAAA TTTAGAATTA GTAACAGAAA TTGAAGAGAT CGCTCAGACT CTCCTTAATA GTTTAGAAGT AGTAGGAGTA TTTGGGATAG AAATGTTTAT TACTAAAGAC AAGAAGGTTT TAATTAATGA AATTGCTCCA AGAACTCATA ACTCTGGTCA TTACAGTTTG GATGCTTGTG AAGTTTCCCA GTTTGAACAA CATTTACGGG CTGTTTGTGG TTTACCTTTA GGTAATACTA CTCTCAAAGT AAGGAGAGCG GTGATGGTAA ATTTATTGGG TTATGAATTT GGGGAAAATT ACTACTTGAC AAAACGGCAA ATGTTAGAAA AAATTCCTCA TGCTTCGGTT TGGTGGTATG GCAAAACAGA ATCTCGCCCA GGACGGAAGT TAGGTCATGT GACTGTTTTG CTAGATGAGG AAAATTTTGA AATTATGGGT CGCAAGGGTG AGGCGATCGC TAATAAAATA GAGAATATCT GGTACACCTA A
|
Protein sequence | MVKMKMNEEE KKQQNKIQRV GVVGGGQLAW MMGDAAKKLG VDLIIQTPHQ DDPAVSIAKD IILAEIDDPQ ATTKLANICD VITFENEFIN IDELSYLAEK NVIFRPSLSV LKPLLDKYEQ LCYLRYLGLP VPNFWEWGIE IEPLSFPLVL KARRHGYDGQ GTFIIKNIES LKSQGNSEFF IQEFIPFERE VAVIAARGVT GEVKVYPVVE TQQENQVCQR VFVPDENLEL VTEIEEIAQT LLNSLEVVGV FGIEMFITKD KKVLINEIAP RTHNSGHYSL DACEVSQFEQ HLRAVCGLPL GNTTLKVRRA VMVNLLGYEF GENYYLTKRQ MLEKIPHASV WWYGKTESRP GRKLGHVTVL LDEENFEIMG RKGEAIANKI ENIWYT
|
| |