Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0445 |
Symbol | |
ID | 3748151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 522293 |
End bp | 523588 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637772978 |
Product | hypothetical protein |
Protein accession | YP_378761 |
Protein GI | 78188423 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000565981 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATTT ACACCAGTCC TGAGACATCG CCGTCCCCCA TTGCAATACT CTGTAGTGAG TACGTGCAAA GTGTTGAAGC AATGGCTGAG AGCCTCCCGT TCATTATGAG TACGCTCATT GAGGCCGAAG ACACTTTTGA TAAGAAGCTT GATGCCTTTA TTGATTTCCA TGCCATAGAC GTCGAGAAGT TGGAAGATGG ACGACGCTAT GGGCTCAAGC TTGAGGACAA GTCAGCGCAC GACAGACTGC ATCGCCGAAT ACGAGTATTT CGAGAGGCAC TTGGGGTTAC TCCACGGAGC TTTCTTGTTG CGTTGGTCAG TGCATATGAT GCCTTTCTTG GCCGTCTCAT TAGGAGTCTA TTCTATGCAC GGCCAGAACT TCTGAACTCA TCTGAGCGCG TATTAACATT TGCGCAGCTT CAGGATCTCC AGACTTTGGA TGCCGCCCGC GAGTATCTTG TCGAAAAGGA GGTTGAATCC GTCTTGCGTA AGAGTCATTC CCAGCAATTT GAATGGCTTG AGAAGACCTT CTCAGTACCT CTACGGAAAG GATTGGAGTG CTGGCCTCGC TTCATCGAGC TAACTGAGAG ACGGAATCTC TTTGTTCATG CGGACGGTAT AGTTAGTTCA CAGTACCTTA ACGTCTGTGG TGAGCATGGC GTTACACATA GTGAAGTTCT TACATCTGGT ACACGTCTGC ATGTCGAGCG ATCGTATTTT CAGTTATCGG CTCATTGCCT TATGGAAATT GGCATTAAAC TTGCGCATGT ACTGTGGAGG AAGCTTGTTC CCACAGATCG CGAAAAAGCC GATGAAAATC TTATAGAGAT TGTTTACGAT TTGCTTATCA AACAAAAATA CCGGCTCGCA GCAGACCTTG CTTCCTTTGG AACCAACACG ATCAAGTCAC ACGGCTCCGA CCAGACCAGA CGCATACTTG TTGTCAATCT CGCGATTGCG CATAAATTTG GTGGTGATGC AGAAAAATGC ACCGCCGTTC TCGACGCTGA AGATTGGAGT GCAACTGCCG ATGACTTCAG GTTGGCAATC GCAGCTCTTC GCGACAACTT CGACGAAGCA GCAAAGCTTA TGAAGTCGAT AGGCAAAGAC AATCGTCTTG GGATGTTTGA GTACCGTGAA TGGCCGGTTT TTCGCGACTT CAGGAAGAGT TATCAATTCG CGGCTGCATA CAAGGAAGTG TTCGGAGAAG AATTCGTACT CAAAGCTCAG GAGCCAAAGG CATCAGAATC ACCCCAGACC ACGGCCAAAG GAGAGAATGT ACTCGACGAA GAATAG
|
Protein sequence | MSIYTSPETS PSPIAILCSE YVQSVEAMAE SLPFIMSTLI EAEDTFDKKL DAFIDFHAID VEKLEDGRRY GLKLEDKSAH DRLHRRIRVF REALGVTPRS FLVALVSAYD AFLGRLIRSL FYARPELLNS SERVLTFAQL QDLQTLDAAR EYLVEKEVES VLRKSHSQQF EWLEKTFSVP LRKGLECWPR FIELTERRNL FVHADGIVSS QYLNVCGEHG VTHSEVLTSG TRLHVERSYF QLSAHCLMEI GIKLAHVLWR KLVPTDREKA DENLIEIVYD LLIKQKYRLA ADLASFGTNT IKSHGSDQTR RILVVNLAIA HKFGGDAEKC TAVLDAEDWS ATADDFRLAI AALRDNFDEA AKLMKSIGKD NRLGMFEYRE WPVFRDFRKS YQFAAAYKEV FGEEFVLKAQ EPKASESPQT TAKGENVLDE E
|
| |