Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_2471 |
Symbol | |
ID | 8391796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 2492829 |
End bp | 2494049 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644980438 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_003138175 |
Protein GI | 257060287 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0400427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.45906 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAGG GGTTCGAGTT TGATGTGTTT CTAGCGCATA ATAGTGTGGA TAAACCCCAT GTTAGGGAGA TTAGTAACAA ACTAAGGGAA CGAGGGTTAA AACCTTGGCT AGATGAGGAA CAAATCCCTC CTGGGATGTC ATTTCAGGAT GAAATTCAAA AAGCGATTCC CCTGATTAAA TCGGCAGCTA TTATTATTGG TACTCAGGGA TTAGGAAAAT GGCAGATCAT GGAACTGCGA TCGCTTATCA CTAAATTTGT GAATCTAAAA ATTCCTGTTA TTCCTGTTTT GTTGCCAGGG GTTAATAATA TTCCAGGTGA TTTACTATTC CTACAAGAAC TTAATTGGGT TAAGTTTGAA CAGATTGATG ATGCTACGGC TTTTTATCGG CTAGAGTGGG GCATTACTCA GGTTAAGCCG GAGTTACATC CCAAAACTGT ACAATTGACT GCCGAGGAAT GGTTTAACCT TGGCTATAAC AAGGGTGAAT CAGGAGACAA CCAAGGTGCG ATCGCTGACT TTAATCAAGC CATTAAAATC AAATCCGACT TGGCAGAAGC GTACTACAAT CGCGGGTTAG CCAAGTCTAA CTTAGGAGAC TATCAAGGTG CGATCTCTGA CTACAATCAA GCCATTGAAA TCAAACCCGA CTATGCTGCT GCCTACAACA ATCGTGGATT AACTAAGTAT AACTTAGGAG ACAACCAAGG TGCGATCACA GACTACACTC AAGCGATTGA AATCAAACCC GACGATGCTG ATGCCTACTA TAATCGCGGG TTAGCCAAGT ATAACTTAGG AGACAAGCAA GGGGCGATCG CTGACTACAA TCAAGCGATT AAAATCAAAC CCGACTATGC TACTGCCTAC AACAATCGCG GGAATGCTAA GTATAACTTA GGAGACAAGC AAGGGGCGAT CGCTGACTAC AATCAAGCGA TTAAAATCAA ACCCGACTAT ACCCTTGCCT ACATCTGTTG CGGGTTAGCC AAGTCTAACT TAGGAGACAA CCAAGGTGCG ATCACTGACT ACAATCAAGC GATTAAAATC AAACCCGACT ATGCTGATGC CTACATCTGT CGCGGGAATG CCAAGAAAAA CTTAGGAGAC AACCAAGGTG CGATCGCTGA CTACAATCAA GCAGCACAAC TTTACTCGCA GCAAAATAAT ATGGAATGGT ATCTTAAAGC CCTTGAAAAG ATCAAAAAAC TTGAACAATG A
|
Protein sequence | MSEGFEFDVF LAHNSVDKPH VREISNKLRE RGLKPWLDEE QIPPGMSFQD EIQKAIPLIK SAAIIIGTQG LGKWQIMELR SLITKFVNLK IPVIPVLLPG VNNIPGDLLF LQELNWVKFE QIDDATAFYR LEWGITQVKP ELHPKTVQLT AEEWFNLGYN KGESGDNQGA IADFNQAIKI KSDLAEAYYN RGLAKSNLGD YQGAISDYNQ AIEIKPDYAA AYNNRGLTKY NLGDNQGAIT DYTQAIEIKP DDADAYYNRG LAKYNLGDKQ GAIADYNQAI KIKPDYATAY NNRGNAKYNL GDKQGAIADY NQAIKIKPDY TLAYICCGLA KSNLGDNQGA ITDYNQAIKI KPDYADAYIC RGNAKKNLGD NQGAIADYNQ AAQLYSQQNN MEWYLKALEK IKKLEQ
|
| |