Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1679 |
Symbol | |
ID | 6375365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 1820175 |
End bp | 1821230 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642684173 |
Product | hypothetical protein |
Protein accession | YP_001960079 |
Protein GI | 189500609 |
COG category | [R] General function prediction only |
COG ID | [COG1106] Predicted ATPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000489043 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGAGC GTCTTGAACT GAGAAACCTC ACGGTCTTCA CCGGCCTAAC ACTGGAGCTC TCGCCCAAGA TCAACGTGAT CATCGGCGAG AACGGCACCG GCAAAACCCA TCTGCTCAAG GCGGCCTATG GGCTTTGCGC TGGTGCACCC CTCTTTAAGA ACAAGCCCGA GACCAGCGAC GATGAGCTTG AAGCGGCGCT AACCGCCAAG CTGCTCCGGC TCTTCATGCC GCTGGACGAC AAGCTCGGCA AGATGCATCG CCAGGGCGCG ACCGACCAGG CCTATCTGTC GGCCCGGTTC GCGGGGGGGC AGAAGATTGC CGCGACCTTC TTCAACAACT CGAAGGCGCT GGCAGTACAG GATCGCACCA ACTACGAGCA GTACCAGGCC GAGGCGGTAT TCATCCCGAC CAAGGAAGTC CTCTCGTTCA TGAAGGGGTT CAACAGCCTG TACGAGAAAT ACGGGCTCTC CTTTGACCAG ACCTATCAGG ACATCTGCCT GCTGCTGGAT CTCCCCGAGG TCCGTTCGGA AACCCTGCAC GAAAAATCCA AATGGGCCAT GTCGGAGATC GAAGGCATCT GCGGCGGCCG TTTCGTTTTC TATGGTGGCG GCAAGGTCAC CTTCAAAACG GAGAATGCCG AATACTCCGC CAACTCCATG GCCGAAGGCT TCCGCAAGGC GGGGATACTC TCCCGCCTGC TGGAGACCGG CGCGATCCAG CCGGGTGTCA GCGGCCCGCT GTTCTGGGAT GAGCCCGAAT CCAACCTGAA CCCGAAGCTG ATGAAGCTGC TCGTGCAGAT CCTGCTGGAG CTGTCACGCA ACGGCCAGCA GATCATTCTG GCGACCCACG ATTATTTTCT ACTCAAATGG CTCGATCTTG AAATGAACAA GAAAGACCAC GTTCTCTATC ACGCACTCTA CCGTGGGGCG AAAGGTGAGA TATGCGTCGA AAGTGCTGAA GACTATCGCT CCATTTCCCC GAATGCTATC GCGGACACCT TCAACGATCT GACCAAAGAG CAGGTGAACA AGAAAATGGG AGGGTTGGGG AAGTGA
|
Protein sequence | MIERLELRNL TVFTGLTLEL SPKINVIIGE NGTGKTHLLK AAYGLCAGAP LFKNKPETSD DELEAALTAK LLRLFMPLDD KLGKMHRQGA TDQAYLSARF AGGQKIAATF FNNSKALAVQ DRTNYEQYQA EAVFIPTKEV LSFMKGFNSL YEKYGLSFDQ TYQDICLLLD LPEVRSETLH EKSKWAMSEI EGICGGRFVF YGGGKVTFKT ENAEYSANSM AEGFRKAGIL SRLLETGAIQ PGVSGPLFWD EPESNLNPKL MKLLVQILLE LSRNGQQIIL ATHDYFLLKW LDLEMNKKDH VLYHALYRGA KGEICVESAE DYRSISPNAI ADTFNDLTKE QVNKKMGGLG K
|
| |