Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_0141 |
Symbol | |
ID | 8389444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 147925 |
End bp | 149031 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644978188 |
Product | biotin synthase |
Protein accession | YP_003135947 |
Protein GI | 257058059 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0502] Biotin synthase and related enzymes |
TIGRFAM ID | [TIGR00433] biotin synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00000566121 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGTTCAAG CCCCCTTATC ACCATCTCTT CTTCAGACTC AATCCATTTC CCAACCGCCT CAAGAAACAG AGGCATTAAA AGCATGGCTT GAGGAATTAA CCCAAAAAAT CATCGAGGGC GATCGCATCA GTAAATCAGA AGCCCTGACC CTGACCAAAA TTGAAGGTCA AGACAATATT CTTGTATTAT GTGAAGCAGC CGATCGCATC AGACAAGCTT GTTGTGGCAA TGTAGTCGAT CTGTGTAGCA TTATCAACAT TAAATCCGGC AACTGTTCAG AAAATTGTCG CTTCTGTTCC CAGTCAGTTT ACCATCCAGG GGAAAATTCC CCCATTTATG GGCTAAAATC CTCAGAGGAA ATTCTCGCTC AAGCCAAAGC GGCTGAAGCG GCTGGGGCAA AACGCTTTTG TCTGGTCAGT CAGGGACGAG GACCGAAATA TCAAGGAGCA AAATCCAAGG AATTTGAGCA AATCTTAGCA ACCGTTCGGC AAATTGCCGC CGAAACCTCT ATTAAACCCT GCTGCGCTCT AGGGGAAGTG ACCCCAGAAC AAGCCCAGGC TTTAAGAGAA GCCGGAGTTA CCCGCTATAA CCACAATTTA GAAGCCTCAG AAGGATTTTA TCCCGAAATC GTCACCAGTC ATAGTTGGCG CGATCGCGTG GAAACCATTA AAAACCTCAA AGCAGCCGGG ATTCAAGCTT GTAGCGGCGG AATCATGGGC ATGGGAGAAA CTTGGGAAGA TCGGGTGGAT TTAGCCCTCG CTTTGCGGGA ATTAGGCGTA GAATCGGTTC CGATTAACCT CCTCAACCCC AGAGAAGGAA CCCCATTAGG AGACTGTCAT CGTCTAGATC CCTTTGAAGC TCTCAAGGCG ATCGCTATTT TTCGCTTGAT TCTCCCTCAA CAAATCCTGC GCTACGCGGG TGGACGGGAA GCGATTATGG GAGACTTACA AAGTCTAGGG CTAAAATCGG GAATTAATGC TATGCTGATT GGACATTATC TAACAACTCT AGGACAACCA CCAGAGAAAG ATCTGGCTAT GGTTGAATCT TTAGGCTTGC AAGGGGGTGA AGCTCCAATT CCTGGTGAAT ATCAAACGCG ATCGTAA
|
Protein sequence | MVQAPLSPSL LQTQSISQPP QETEALKAWL EELTQKIIEG DRISKSEALT LTKIEGQDNI LVLCEAADRI RQACCGNVVD LCSIINIKSG NCSENCRFCS QSVYHPGENS PIYGLKSSEE ILAQAKAAEA AGAKRFCLVS QGRGPKYQGA KSKEFEQILA TVRQIAAETS IKPCCALGEV TPEQAQALRE AGVTRYNHNL EASEGFYPEI VTSHSWRDRV ETIKNLKAAG IQACSGGIMG MGETWEDRVD LALALRELGV ESVPINLLNP REGTPLGDCH RLDPFEALKA IAIFRLILPQ QILRYAGGRE AIMGDLQSLG LKSGINAMLI GHYLTTLGQP PEKDLAMVES LGLQGGEAPI PGEYQTRS
|
| |