Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3678 |
Symbol | |
ID | 8393020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 3756714 |
End bp | 3758045 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644981603 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003139325 |
Protein GI | 257061437 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.35091 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCGA TCGCCACCGA AATTATTTTT ATTCTTCTGC TGATTATCGC CAATGGGATC TTTTCTGGTT CAGAGATGGC TATCGTCTCC TCTCGTAAAG TCCGTTTAGA ACAACTGGCT AGTCGAGGCA ATCGTCAGGC AAGAACGGTA CTAAATCTCA TCAATGCCCC GAATAACTTC CTCTCTACCG TACAAATTGG GATTACCCTG ATTGGGATTC TCAGTGGTGC AATAGCCGGA GCAACCCTAG CTGAACGATT AACGGCGATC TTCCAAAGGA TTCCTCTATT AAAAGCTTAC AGTCAAGGTA TTAGTGTCGG TATTGTCGTC GGGGTGATTA CCTTTCTTTC CTTAGTGATG GGCGAATTAG TCCCCAAACG CATCGCCCTC AATGCACCTG AAAAAATTGC TTGTGCAGTG GCACAACCGA TGAAACTGCT CTCGCGTTTT GCAGCCCCTA TCGTCAATTT ATTGAGTGCC TCGACGGATT TTTTACTAAA ATTATTGGGG ATTAAAGTTT CCGATGAACC AGCCGTAACA GAAGAGGAAA TTAAGGTACT CATTCGTCAA GGGGCTGATT TGGGGTTATT TGAGGAGTCT GAACACGAAA TGGTAGAACG GGTATTTCGT CTAGGCGATC GCTCCGTTAA ATCCCTGATG ACCCCCCGTA AAGAGATAGT TTGGCTCGAT ATCGAGTCAC CCTTAGCGGA GAATTTGCAA GAGGTTATCG ATAGTGGCTA TTCACGTTTT CCTGTGGGAC GGGGGAGTTT AGATCAATAC ATGGGGGTAG TTCGAGGAAA CAGTCTGTTA GCTGCTTGTC TGTCGAACCA GGAGGTTGAT CTCGAATCTT TTCTACAACA ACCCCTCTAT ATTGCTGAAA ATACTCGCGC ATTAAACGTC CTAGAACAGT TTAAGCAAAC AGGCATCCAT ACAGCCCTGG TAATCGATGA ATACGGCGGA ATTGAGGGCT TAGTGACCCT TGATGATGTG GTAGAGGCGA TTTTAGGTGA ATTACCCTCG GCCGAGGATC TTGAGGCTCC CATGGCGGTT CAACGGGAAG ATGGTTCCTG GTTGTTGGAT GGGTTACTGG CGATCGATGA TTTTAAAGAG CTTTTCTCAG ATCTGCCACT TCCGGAAATT TCCTCTCAAC AGTACCATAC CCTCGGCGGT TTCATGATGT ATTCTCTCAA GCGCATCCCC CAAGCAAGTG AGTATTTTGA GTGGGGAAGG TTACGCTTGG AAGTGGTAGA TATGGATGGA ACACGAGTCG ATAAGGTATT AGTGACGGTT CTCGACAATG CTCCCGAAGA TGAACCGATT AACGAGGAGT AG
|
Protein sequence | MSAIATEIIF ILLLIIANGI FSGSEMAIVS SRKVRLEQLA SRGNRQARTV LNLINAPNNF LSTVQIGITL IGILSGAIAG ATLAERLTAI FQRIPLLKAY SQGISVGIVV GVITFLSLVM GELVPKRIAL NAPEKIACAV AQPMKLLSRF AAPIVNLLSA STDFLLKLLG IKVSDEPAVT EEEIKVLIRQ GADLGLFEES EHEMVERVFR LGDRSVKSLM TPRKEIVWLD IESPLAENLQ EVIDSGYSRF PVGRGSLDQY MGVVRGNSLL AACLSNQEVD LESFLQQPLY IAENTRALNV LEQFKQTGIH TALVIDEYGG IEGLVTLDDV VEAILGELPS AEDLEAPMAV QREDGSWLLD GLLAIDDFKE LFSDLPLPEI SSQQYHTLGG FMMYSLKRIP QASEYFEWGR LRLEVVDMDG TRVDKVLVTV LDNAPEDEPI NEE
|
| |