Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4270 |
Symbol | |
ID | 7103816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 4483588 |
End bp | 4484871 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643477250 |
Product | hypothetical protein |
Protein accession | YP_002374349 |
Protein GI | 218248978 |
COG category | [R] General function prediction only |
COG ID | [COG4671] Predicted glycosyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAAAC AACTTAGTTT CAAACAAAAT CGTCCCTCTA TAAATAATAT AATGCCAGCC ACACCCATTA AAATGTTATC CAATCTCAAA AAGGGTCAAA AATTGAGAAT CGCCCTTTAT TCCCACGATA CCATGGGATT AGGACATAAA CGCCGGAACC TATTGATTGC CCAAAGTTTA GCATCTTCCG CGCTCAATGC TGACATTTTA ATGATTAGTG GAATGAGTGA AATGACGCAA GTGGGAACTC ATCCTAACAT TGAATATTTA ACCTTACCTG CACTTTATAA AACTTCTGAG GGAGACTATC AAGCGCGTCG TTTAGCGATG TCCCTCGACG AAATTATTCA TTTGCGATCA CAGGTTATTC GTACAGCAAT TAAGAATTTT CAACCGGATG TTTTTATTGT TGATAATGTT CCGAGGGGAG CAATGGGAGA ACTAAATGAC ACCTTAAAAT ATCTCCGCAA TCAAGGTAAC ACTTTATGTA TTTTAGGGTT ACGCGATATT TTAGATACCC CAGATGTTAT TAATAGAAAT TGGAAAAAAG TTAACAATGA AAAAGCCATT CGCCGTTATT ATAATGCCCT GTGGGTTTAT GGTGATCCAA CGGTTTATAA TTTAGTGAAG GAATATCAGT TTGAACCGGA TATTGCTCGC AAAGTTTATT ATACAGGATA TCTCGATCAA CGAATCAGAC GGAACTATTC TCAAACAGAA CAAAAACAGT CCCTAACTCT TTCATCAGGA CGGTTAGCCT TATGTTTAGT CGGTGGGGGA CAAGATGGAA GTAACTTAGC CGAAACCTTT GCTCAAACGC AATTACCAAC TAATATGCAA GGTGTCATTT TAGCAGGACC CTTGATGCCG CGTCACTTGC GTCAACAGCT AAAAGAATAT ACAGCCAGTC GTCCCAATTT ACAGGTATTA GACTACGTTT CTGAACCCAC TTTTTTACTA GAAAAAGCTG ATATTGTTGT CGCAATGGGA GGCTACAACA CGACTTGCGA GATTTTATCC TTTGAAAAAC CTTCCTTAAT TGTTCCTCGT ATTGAACCAA GGGAAGAACA ATTAATTAGG GCGCAACGGT TACAAGAATT AGGGTTAGTT GATATGTTAC ACCCTAATCA TCTTACCCCC CAAGCATTAA GTCAATGGTT ATCTCAAGAA GCTAAAACCC CAAAAGTCCA TCAAACCCTT GATTTTCAAG GCTTAAACCG GATTCCCCAA TTGGTAGCAG AAATGTTAGA TACAACCTTG TTTCAACATC TAAAAGCCAG TTAA
|
Protein sequence | MTKQLSFKQN RPSINNIMPA TPIKMLSNLK KGQKLRIALY SHDTMGLGHK RRNLLIAQSL ASSALNADIL MISGMSEMTQ VGTHPNIEYL TLPALYKTSE GDYQARRLAM SLDEIIHLRS QVIRTAIKNF QPDVFIVDNV PRGAMGELND TLKYLRNQGN TLCILGLRDI LDTPDVINRN WKKVNNEKAI RRYYNALWVY GDPTVYNLVK EYQFEPDIAR KVYYTGYLDQ RIRRNYSQTE QKQSLTLSSG RLALCLVGGG QDGSNLAETF AQTQLPTNMQ GVILAGPLMP RHLRQQLKEY TASRPNLQVL DYVSEPTFLL EKADIVVAMG GYNTTCEILS FEKPSLIVPR IEPREEQLIR AQRLQELGLV DMLHPNHLTP QALSQWLSQE AKTPKVHQTL DFQGLNRIPQ LVAEMLDTTL FQHLKAS
|
| |