Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0478 |
Symbol | |
ID | 6374142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 501397 |
End bp | 503367 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642682996 |
Product | hypothetical protein |
Protein accession | YP_001958923 |
Protein GI | 189499453 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1555] DNA uptake protein and related DNA-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.522287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCAA CCATTTTTTT AGCGGCATTT CTCCATCTAT TCATTTTTCA ATCCCTGCCG GTTTTCGCAG ACGACGATCT CGAAGCCCTC TTTGATCAAA GCGATATGCC CGGCGACATC GAGCAGCTGC TTCTTGAACT GCAGGAGCTG AAGCAGAGGA AAATCCCTGT CAATAGCGCG ACGGAAGAGG ACCTTCTGCT TATCCCGTTT CTCTCGAACG ACGATGCCCG CAGGATCATC GAGTACAGGG AGAAGAACGG CCCCCTGACT TCTGTGGGGC AGCTTGCCGG GGTTATCGGC AGTGACCTGG CGCGCAGGAT TTCACTGTTT CTCTCCTTTG AGTCCCCGAG GCTTATAGTT CCTGAGAAAG CGGTTCCCTT TAGCGGAAAC TGGTACGGCA GATACTTCAG TGAAAGCCCC GAGCGGAGCG GGATTCTTTC AGGGAAATAC GGAGGAGAGA GCTACAAGTT GTACAACCGC TTGCAGGTGG TCAACGGGGG GATTTCGGTA AACGGGGTAA TGGAAAATGA TGTCGGAGAG CCTGATATCG ACGACTTTAC CTCGTTGAGT GTCGCATACG ACGGTTCCGG GAGTTTCGAG CGGCTGATAG CCGGTAACTA TACGGTCAAT TTCGGTCAGG GGCTGTTGTT CGGGCAGAGC AGATACCTTT CAAAAGGGGT AGATCCTCTC GGGGTGAAGC TTTCCGGGCG TCGGCTCAAA GCCTACGCTT CAAGTGCGGA AAACGGTTTT ATGCAGGGCG CGGCGACAAC TCTCAATCCG GACCCGTTCA GGCTCACGGC GTTTTATTCC AGCAATCTGA TCGATGCTTC CGTGGAAGAC GGAACAGTCA CCACCATCCG TACATCAGGC TACCATCGAA CTGAGAGTGA AATCGAGCAC AAAGATAACG TGACTGAGCA GGCTGGAGGG GTGAACATCC TCTACACGCT TGATTCCGGG CCGGTCAATG GAACTGTAGG TGGGACATGG GCGCGCTACC GCTATTCGAT GCCCCTTGAC GATATCGAAG GCAGCGGGGA ATGGCTTGAT ATGGGAGGTG TCGAGGCTGA TCTGCTCATA GGGAAGGTCA ATGTTTTCGC GGAAGCTGCT GTGACCGGCA AAGATCCCCG GCTCTCCTGG ATCAGCGGAA TGCGTTTTCC GTTGACCGAT GATATCCGCA CTGTACTTGT GGTCAGAGAT TATCATAACC GGTACTTTTC CCCCTTCGCT GGCGCTTTCG CTGAACGTGC GGATGACGCG TCAAACGAAG AGGGCTATTA TATCGGTCTT GAAGCAAAAA TCCTGAAGAA CCTTCGCCTC GGGGCCTACT ACGATATCTT CAGGTTTCCC GAGCTCAGCA GCCGATACCG ATTGCCATCG ACAGGGGACG AAGCGAAAAT TTTTCTCACC TGGAAACAGT CCCCGGTGTT GACGACGGAA CTGCTGTTGC AGAACCAGTA CAAGGAAGAG GCCAAAAAAC TTGAGGACGG ATCAGGTCGT GAATATTACC AGCCGGTTCC CTTCAGGTCG AACCGCGCAC GCCTTGGCCT TATCGGAAAA GTTTCCAGGT GGCTGACGCT CAAGACAAGG GGAGAGATCA AGTTTGTGGA TGGAGAGTAT CCCGATGGTG ATGACCATTC CGAAGGGTGG CTGATCTATC AGCAGGCGAC GATACGCAAG GATCCTGTCA CCTTCAAGGC CCGCTACACC AGGTTCTTTA CCGATGACTT CGACTCTGCG ATCTATGTCT ATGAAGATGA CCTGCCGCTG GTCTTTACCC TGAAATCCTA CTATGGAGAG GGACAGGCCG CTTTCGCGGT TGTTTCGCTT GATCTTCTCA AGAATTTCAA ACTCTCCGCC CGATACGGCA AAACATGGTA TGACGATCGC GAGGTATACA GCAGCGGCAA CGACAAACGA GAAACCAACG CCCCCGCGTC GTATCATCTC GGTTGTGCGT TACGGTTTTG A
|
Protein sequence | MRATIFLAAF LHLFIFQSLP VFADDDLEAL FDQSDMPGDI EQLLLELQEL KQRKIPVNSA TEEDLLLIPF LSNDDARRII EYREKNGPLT SVGQLAGVIG SDLARRISLF LSFESPRLIV PEKAVPFSGN WYGRYFSESP ERSGILSGKY GGESYKLYNR LQVVNGGISV NGVMENDVGE PDIDDFTSLS VAYDGSGSFE RLIAGNYTVN FGQGLLFGQS RYLSKGVDPL GVKLSGRRLK AYASSAENGF MQGAATTLNP DPFRLTAFYS SNLIDASVED GTVTTIRTSG YHRTESEIEH KDNVTEQAGG VNILYTLDSG PVNGTVGGTW ARYRYSMPLD DIEGSGEWLD MGGVEADLLI GKVNVFAEAA VTGKDPRLSW ISGMRFPLTD DIRTVLVVRD YHNRYFSPFA GAFAERADDA SNEEGYYIGL EAKILKNLRL GAYYDIFRFP ELSSRYRLPS TGDEAKIFLT WKQSPVLTTE LLLQNQYKEE AKKLEDGSGR EYYQPVPFRS NRARLGLIGK VSRWLTLKTR GEIKFVDGEY PDGDDHSEGW LIYQQATIRK DPVTFKARYT RFFTDDFDSA IYVYEDDLPL VFTLKSYYGE GQAAFAVVSL DLLKNFKLSA RYGKTWYDDR EVYSSGNDKR ETNAPASYHL GCALRF
|
| |