Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4043 |
Symbol | |
ID | 7104617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 4236093 |
End bp | 4237274 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643477036 |
Product | aminotransferase class V |
Protein accession | YP_002374136 |
Protein GI | 218248765 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATTT ATCTGGATTA TAGTGCGACA ACTCCCCCTC GTTCAGAAGC GATCACCCAA GTGGAGGCGA TCCTCAAACA ACAGTGGGGT AACCCGTCCA GTTTGCATAA CTGGGGACAA CAAGCTGCTA CGATCCTCGA AACCGCGCGT TGGCAAGTGG CTAATTTGAT TAATGCGCCC TCAGCAGATT CTATCATTTT TACCTCCGGG GGAACCGAAG CCGATAATCA TGCCTTGCTA GGCATAGCCA GATCCTACAG TAAGCCACAA CATTTGATTA TTTCCTCGGT GGAACATTCG GCAATTTCCG AAACTGCTCA AATCCTAGCA CAATCGGGGT GGCAAGTCAC GATTTTACCG GTTAACCGTC AGGGAAGAGT GACTCCACTG GAATTGAAAG CAGCTATTCG ACCCAATACG TCCCTAATTT CGATTATTTA TGGCCAAAGC GAAATCGGAA CCATTCAACC CATTGAGGAA TTGGCGAAAA TTGCCCAAGC AGAAGGGGTG CTTTTTCATA CTGATGCGGT ACAGGTAGCC GGAAGATTAC CCCTCGATGT CCAACGGTTG GGGGTGGATT TATTGTCGCT TTCGGCACAC AAAATCTATG GGGTTCAAGG GGCCGGGGCG TTATATGTGC GTCCAGGGGT AGAAATTGCC CCTTTGTTGG CAGGAGGAGG GCAAGAACGA CGGTTACGGT CAGGAACCCA AGCTGTCCCG GCAATCGCAG CCTTTGGGAT CGCGGCTGAA TGGGCGGCCA CAGAAATAGC CACGGAAACC CCTCGGTTAC GCGGACTGCG CGATCGCCTT TTCGATTTGA TGGCCGATTG TCCCTATCTT ATTCCCACGG GGGATAGATT GTATCGCCTT CCCCATCATG TGAGTTTTAT TGTGACTGAC CCTTTTAATC AAAAAGTCTC CGAAAGGATT ACGGGTAAAA CGATTGTTCG TCAGCTTAAT TTAGCCGGAA TTGGGATTAG TGCTGGTTCA GCGTGTCATA GTGGTAAATT GAGTCCCAGT CCGATTTTAT TGGCGATGGG TTATTCTGAA AACGAAGCGT TAGGGGGTAT TCGTTTAACT CTCGGACGGG AAACAACTTT AGAAGATATT GAATGGACGG CTATGGTTCT TAAGCAAGTT TTAGGGCGTT TAATGCCACA ATTGGAATGT GTTGGGTGTT AA
|
Protein sequence | MQIYLDYSAT TPPRSEAITQ VEAILKQQWG NPSSLHNWGQ QAATILETAR WQVANLINAP SADSIIFTSG GTEADNHALL GIARSYSKPQ HLIISSVEHS AISETAQILA QSGWQVTILP VNRQGRVTPL ELKAAIRPNT SLISIIYGQS EIGTIQPIEE LAKIAQAEGV LFHTDAVQVA GRLPLDVQRL GVDLLSLSAH KIYGVQGAGA LYVRPGVEIA PLLAGGGQER RLRSGTQAVP AIAAFGIAAE WAATEIATET PRLRGLRDRL FDLMADCPYL IPTGDRLYRL PHHVSFIVTD PFNQKVSERI TGKTIVRQLN LAGIGISAGS ACHSGKLSPS PILLAMGYSE NEALGGIRLT LGRETTLEDI EWTAMVLKQV LGRLMPQLEC VGC
|
| |