Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1976 |
Symbol | |
ID | 6375668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2121033 |
End bp | 2122376 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642684467 |
Product | hypothetical protein |
Protein accession | YP_001960368 |
Protein GI | 189500898 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.116034 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTACGCA CTGCCACGAG CATTATATTG CTTCTCCTGT TGTTTGCGGC GCACCCTGCG GGCGCGGCAG AACAGCAGGA CTTGATCGAC CGTTTCGACA TAACAGCCGG AACAGCAATT TCCTTGGATT CCGAGCCTCC GTTCTGGATG TGGGCGAACC GGGACGGCAT TATTCCTGAC GACACCGGAA CTACATCGTT CACCCGCCTG CAGCTTGGAA AAGAAGCCGA CGACTACAAG CGGTTCGACT GGACGTACGG TCTTGATGTG ACGGCAAGGA GTTCGGGCGA TTCCGATGTG CTGTTTACCG ATGCGTATGT CGGGCTCAAG TTCAGCGATC TCCATCTGAC ACTCGGCCGC AAAAGCGAGT TTTTCGGGCT TGCCGACAGC CTCCTCACCG TCGGTCCGGA AGCTTACAGC CGTAACGCGC CGCCCATCCC GAAAATCGCC ATATCCACCA ACGGGTTTGT CGATGTCGCC GACTGGCTTG GAGTCAATGC CTACTTCGCG CACGGATGGT TGGGCAGTGA GCATTATGTG CCTGATGCCT ACCTGCATCA GAAGTATCTC TACCTGAAAC TTGGCAGTAC GGTTCCTGAC GAGGGCGTCA ATTTTCTTGC CGGCATCCAC CACCTCGCAG AGTGGGGTGG CGCGGGACAG CCATCCAGGT TCAAGGATTT TCTCCGCATT CTTGTCGGAA AGAGCGGAGA CGAGAGAGCA ACGAAGAGCG ATCAGAAAAA TGCGCTCGGC AACCATCTCG GCTCAATCGA ATACGCACTG CAGTTCAAAG GGTATTCCCG CGACTGGTAC CTCTATGTTC AGACGCTGTT CGAAGACGGC AGCGGCCTGC GCTTCTGGTA TCCTGCTGAT TACCTCGCCG GCCTTTCCCT CATCAGCAAG GAACCCTCGG ACCACTTCAG ACGATTCAAT GTCGAATACA TTGATACCCG GAGCGGCGGC AAAAACCCTG CCGAACCCGA TAACTACTTT ACCAACGGCA CTTACGGCGG CTGGGTGTAT GAGGGGTGGG GAATCGGCCA CCCGTTCATC AGATTTATCT CCCTCGCACC GCTCAACCGT ATTACCGGTG CCAACGCATC TCTGATGCTT CGATACGGAG ATATGCTCAA TCCGCTTCTG CGTGTGGCCT GGGTGCGGAA CTCAGGCAGT TTCAGCGCCC CCCTTGAAGG GGAGGAACAG ACCTATCTCG TTTCCCTTGA CGTTACCAAT ATGATGTACC TCAGGGATGG CTGGTCGCTT ACCCAGCAGC TGAGCATCGA TACCGGAAGC GGGAGTGAGC TGAATCCGGG TCTGCTGCTG ACGGTGACAA AATCGCTGCT CTGA
|
Protein sequence | MLRTATSIIL LLLLFAAHPA GAAEQQDLID RFDITAGTAI SLDSEPPFWM WANRDGIIPD DTGTTSFTRL QLGKEADDYK RFDWTYGLDV TARSSGDSDV LFTDAYVGLK FSDLHLTLGR KSEFFGLADS LLTVGPEAYS RNAPPIPKIA ISTNGFVDVA DWLGVNAYFA HGWLGSEHYV PDAYLHQKYL YLKLGSTVPD EGVNFLAGIH HLAEWGGAGQ PSRFKDFLRI LVGKSGDERA TKSDQKNALG NHLGSIEYAL QFKGYSRDWY LYVQTLFEDG SGLRFWYPAD YLAGLSLISK EPSDHFRRFN VEYIDTRSGG KNPAEPDNYF TNGTYGGWVY EGWGIGHPFI RFISLAPLNR ITGANASLML RYGDMLNPLL RVAWVRNSGS FSAPLEGEEQ TYLVSLDVTN MMYLRDGWSL TQQLSIDTGS GSELNPGLLL TVTKSLL
|
| |