Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0474 |
Symbol | |
ID | 4568518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 522244 |
End bp | 523782 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639765073 |
Product | hypothetical protein |
Protein accession | YP_910955 |
Protein GI | 119356311 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATCAC TGCTTCTTCA GATAAAAGAG TCCTATCCGA TGGTGTATGA GGCGTTCAGC GCGTTGCCTG ACGGTGAGCG GCATCTGCGG GGTATTATGG AGCTTGACCG ATACTGGGAG CATCTTCATC AGCCGGTTCC CGAAGTCGTT AAGCCCGGAT CGGAACTTCC GGAGGGTGTT GAGGTGAATG GTGATTTCGA TCTGATCTAT GCAGGTGGAA CGCTCAGCCT TCTGCATGCT GCCGTGATGG CCAGGCAGTA TGACCGCAAG GTGCTGGTTT TTGATCGTCA TACTCCGGCT CAATCGACTC GCGACTGGAA TATATCCCGA GGAGAGCTGT TGAAACTTGC TGATACCGGG GTATTTTCCC TTTCAGAGCT TGAATCGGTT ATTTTACGAG CCTATAAAAC TGGCTGGGTC GAGTTTTACA AACCTGACGG CAGCCAGAAA CGTCTCTATA TCCGGAATGT GCTTGATTGT GCCGTTGATG CAGATCTTCT GCTGTCGATG GCAAGGGAGG TTGTTCTTTC AATGCCTGAA AACAGGGTGC TTTCGCAGAC ATCCTTTACA GCATGTTATC GCTTTGCCGA CCATATTGTT GTTGAGGTTA CCGATAGCGA GGGGAAATCG TACCATTACC GGGGAAAGGT TCTTGTTGAC ATGATGGGTG TTCGTTCTCC TGTGGCAATG CAGCTGAACG AAGGGGCTCC GCAAACCCAT GTGTGTCCGA CGGTGGGTAC GATAGCAAGC GGGTTTATGG ACGCCGATTT CGACACAGGG GAGATTCTTG CAAGTATTGC GCCTGCCGAT ATCGCTTCAG GAACGGGAAA GCAGTTTATC TGGGAGGGGT TTCCGGCTAA AGGCAGCGAG TATATCACCT ATCTGTTTTT TTACGATGAG GTCGATTCAC AAAATGACAA GTCCCTGCTG GGACTGTTTG AAACCTATTT CAGAACGCTT CCGGAATACA AGAAAATCGG ACCTGATTTC AGCATTCATC GTCCGGTATA TGGTATTATC CCGGCATATT TTCATGACGG GTTCAGCCGA ACAAGGGAGA TTGCCGATGA CCGGATCATT TTATTCGGTG ATGCAGCATC TCTTGGCTCT CCCCTGACCT TCTGCGGTTT CGGCTCTCTG GTGCGCAATC TTCACCATCT TACGGCGGAC CTTGATCTGG CACTCGACAG CAATGCGCTT TCAAAAAAGG ACCTTGAAAA AATCAGTGCT TATGAGCCCA ATGTCGCATC CATGGCCAAT CTCATGAAGT ACATGTGTTT CAATGCCGAG ACTGATGAGC CGAATTTCGT CAACGACCTC ATGAATGAGG TGATGGTGGT GCTCGATGAG CTTCCCGAAC GCTATCGTCA GGCTATGTTT CGGGATGAAA TGAAGATTGA GGAGCTTGTT GTGGTTATGT TGCGTGTCGC ATGGCGATAC CCAAAGGTTC TTAAGGCAAC CTGGGATAAG CTTGGCGTTG CCGGTTCGAC GGGTTTTTTG AAAAATCTCG TCGGCTGGGT CTTCTCTTCA GCCAGATAA
|
Protein sequence | MSSLLLQIKE SYPMVYEAFS ALPDGERHLR GIMELDRYWE HLHQPVPEVV KPGSELPEGV EVNGDFDLIY AGGTLSLLHA AVMARQYDRK VLVFDRHTPA QSTRDWNISR GELLKLADTG VFSLSELESV ILRAYKTGWV EFYKPDGSQK RLYIRNVLDC AVDADLLLSM AREVVLSMPE NRVLSQTSFT ACYRFADHIV VEVTDSEGKS YHYRGKVLVD MMGVRSPVAM QLNEGAPQTH VCPTVGTIAS GFMDADFDTG EILASIAPAD IASGTGKQFI WEGFPAKGSE YITYLFFYDE VDSQNDKSLL GLFETYFRTL PEYKKIGPDF SIHRPVYGII PAYFHDGFSR TREIADDRII LFGDAASLGS PLTFCGFGSL VRNLHHLTAD LDLALDSNAL SKKDLEKISA YEPNVASMAN LMKYMCFNAE TDEPNFVNDL MNEVMVVLDE LPERYRQAMF RDEMKIEELV VVMLRVAWRY PKVLKATWDK LGVAGSTGFL KNLVGWVFSS AR
|
| |