Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1895 |
Symbol | |
ID | 3746794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 2406533 |
End bp | 2409610 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637774432 |
Product | hypothetical protein |
Protein accession | YP_380188 |
Protein GI | 78189850 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACTGG GTGAGATGAT GATACAAGGT GTAGCAATGT CGGCATATAG CTGGTCGGGC TATAATGGTA CGGGCTACGA GAGTTTATTG CAGCAGGCGG TGGAGGTTGG GGCTACCTCA GTGCTACTTG GTAGTGTTTC AATTATTGAC CTCAATAATG GAGCGGTTAG CGCGTGGGTG CGTGATGATG GTTTTACCAC CACCGCAAGC ATGGGGGATG TTGAAGCGGC TATTCAACAA GCGCAAGCGC ATGGTTTGCA GGTTTTTTTA AAGCCGCAAA TCCACTCCTA TAATCCAGCA TCTGCTGCTT TTGGTGGTAA TCCATACAAC AATCTTATAA ATCCCGATCC GAGTAATCCG CTTATTATTC CTAATCTCGA TCTCTTTTTT GAGGGCTATA AAGCCTATAT CGTAGAGTGG GCGGAGCTTG CAGAGCGTTA TCAGGTGCCG CTCTTTAGCG TTGGGAATGA AATGGTGGCG GTTACCTCGG CTGAGTTTAC GCCCTATTGG GAGGATATTA TTGCAAGTGT GCGCAATGTG TATCATGGGC AGCTTACTTA TGCAGCAATG ACTGATGTGA AGTGGGATTC GAATGATGAG GTATCGCACA TTGAATTTTG GGATAAGCTT GATTATGTGG GCGTTGATAT GTATCCCGAT TTTGATACCG GTGCAACAAT CCCTACAACG CCAACCGTTG AGCAGCTTAA TGATATTTGG GTAGAGCAAA AGTGGCAAAG CTATTTAAGT GCTATCGCCG AAGCAACGGG TAAGCCGCTT CTTTTTACTG AAACCGGGGT GGCAAGCTTT TTGGGTGGAG CGAATCGTAG TCGTTATACC GATGCGCTGA TTAGCCAAAT GGGCACTGTG CGTGATGATG CAACGCAAAC CAATTGGTTC CAAAGCTTTG CTGAAACGTG GATGGGTGAG AACCAACCTG AGTGGTTTGG TGGCATGTAC TTTTGGAATA ACGACCCTCC ATATAATGCA GGTTTACAAG ATATCACAGG CTATACCTTT TTTGGTAAAC CTGCTGAAGT AGTGGTTAGT AGCCTTTTTG ATGCGGTCAA TAGCCTCGAT TTTGATCAAA CACTTTTTCT TGCCAGCGAT AGTGATGACC GCATTGCTCT CTACAAATAT ATTGCTGAAG CTGATGCAAA TCCGTTGACG CGAGCGCAAA GCTATCATTC CACCGTTATT ATTGAGCTAA ACGGCACTAT TCTTGAAGGT GCTGAAGCGG TAACACCGAC CATCCATTTT TATCTTAATG GCAAAGATTA TGGCGCTGTT ACGCTGAGCA ATGTTGAAAG TGAGTATAGC ATAAATAAAG GAATTGAAGC TGCAAGTAAA GGTGAGGCAT ATTATCCTCA TAGCACGCTT ATCCCATTTC TTTTTGAGAT AGATGAATTA GAGGTTCGCG ACATTCACAT TGTGCGCGAT TCCGTGCAAG TGGAAAACAG CGAAGTGTAT ATCAGCCGTG TAACCATTGT GCCCGATATG GGTGCGGCAA CGGTTAATAC AACGGTTAAT AGTTTACAAA ATGCGTGGCT TGCTTTTGAG GAACCATCCC AAGCATGGGG TGGCGCAACA GGATATCAAT TTCCAAACGG TGCCATTCCT TACGATGTAC CATCCGTTAC CATAGATACT TCACCCTATA AAAAAACGCT TGCCACCATG AGCGGTACGC CTGACAATCC CATTACTGTT AAAGGATACG AAGGCTTCGA CACGGTTTAT TTATTGGGTT CACCTGAGCA ATACACCATT ACTCTTGAGG GCGATATGCT CATGGTAGCC GAAAGTAGTG GCTTAGGGCA AAATAGTCAG CTTGGCGGTG TAGAGCGCTT GCTTTTTGCG GAGGCTGATT ATGCGCTGCT TTTTGGGGGT ATGGGGAATG ATACGCTGTA TGGTGGCGCT GGCAACGACC GCTTTAACGG TGGTGATGGC AATGATGTCG TTTTATTAAG CGGCTATGCT ACCGAATACG AAGTAAGCGA CAATGAAGCG AGTGCAACTT ATACCATTAC CGATAGTGTT GCAGGACGCG ATGGCAGCTA TCAACTTAGT AACATGGAGG CGTTGCAATT TGGAGCATCG CCAATGCAAT GGAATCTTGA AGAGTTTCGG GCGGCGCTTG CAGCCTCGCA ACTTCTCCCG CAACAAGAAG AGCCATTCAA TGTAACAGGC TCAGTTACCT TTTGGAAAAA TGGTGCAGCA ATCAGCAATG TGGCTACAAC GCTCTCGCTG CATTCGGTTA CCAACAATGG CGAAGAGCTG CTCTTTCAAC ATTTGCAACA TCATGCTGAT GGAGGCTATA GTGTTGAAGT GTGGGCAAAT GCGACGGATG CGCTGCATAG CCTCCAGTTC GAGTTCCAAC TACCAACAAA TGCGCAAGCT GCGTGGCATT TTAGTGAAGA GGTGCCACAA GGCTGGCAAA CGGGGGTTAA TAATCAAGGT GCTGATGCGT TGCTTATTGG TGGCATGGGC GCTACCGCAT TGCCGTCGGG ATTGGTGCAG CTTGGCACCT TGAGCTTTGT GGCTCCGACT GATGCTGATC GGCTTGAAAT AGCGCTGACA AGAGGTGAAC TTGGCAAGCA ATGGCTTGTT CCAGCAACCA TTACGCTTGA AAGTAATGTG CTTGCAAGCA ATGGCGGTTA TCAGCATAAT GCACTATGGC AAGGTAGCTA CCATTTAAGC GTGCAGCATG AAAGTACTGA AGAGCCAACC AACATGGTTA CCATGAGCGA TGCGTATGCC GCGTTACAGA TAGCCGCAGG GCATAATCCC AATGAGTCTG AAGCGCCACT GCAATCATGG CAATTCTTGG CTGCTGATAT AAATCGTGAT GGCAAGGTTC GTGCATCCGA TGCGCTTACC ATTTTAAAAA TGGCGCTTAA TTACCACGAT GCTCCAAGTG AAGAGTTGAT TTTTCTGCCC GAATGGGTGG GCAAGAGCGA GATGACGCGC AGTTCAGTTG ATTGGTCAGC GACTGAAATA ATGCTTGATG TTGAAAATTA TCAAATTGTT AACCTTATTG GGGTTATTCA AGGTGATGTT GACGGCAGTT TTAGTTAA
|
Protein sequence | MLLGEMMIQG VAMSAYSWSG YNGTGYESLL QQAVEVGATS VLLGSVSIID LNNGAVSAWV RDDGFTTTAS MGDVEAAIQQ AQAHGLQVFL KPQIHSYNPA SAAFGGNPYN NLINPDPSNP LIIPNLDLFF EGYKAYIVEW AELAERYQVP LFSVGNEMVA VTSAEFTPYW EDIIASVRNV YHGQLTYAAM TDVKWDSNDE VSHIEFWDKL DYVGVDMYPD FDTGATIPTT PTVEQLNDIW VEQKWQSYLS AIAEATGKPL LFTETGVASF LGGANRSRYT DALISQMGTV RDDATQTNWF QSFAETWMGE NQPEWFGGMY FWNNDPPYNA GLQDITGYTF FGKPAEVVVS SLFDAVNSLD FDQTLFLASD SDDRIALYKY IAEADANPLT RAQSYHSTVI IELNGTILEG AEAVTPTIHF YLNGKDYGAV TLSNVESEYS INKGIEAASK GEAYYPHSTL IPFLFEIDEL EVRDIHIVRD SVQVENSEVY ISRVTIVPDM GAATVNTTVN SLQNAWLAFE EPSQAWGGAT GYQFPNGAIP YDVPSVTIDT SPYKKTLATM SGTPDNPITV KGYEGFDTVY LLGSPEQYTI TLEGDMLMVA ESSGLGQNSQ LGGVERLLFA EADYALLFGG MGNDTLYGGA GNDRFNGGDG NDVVLLSGYA TEYEVSDNEA SATYTITDSV AGRDGSYQLS NMEALQFGAS PMQWNLEEFR AALAASQLLP QQEEPFNVTG SVTFWKNGAA ISNVATTLSL HSVTNNGEEL LFQHLQHHAD GGYSVEVWAN ATDALHSLQF EFQLPTNAQA AWHFSEEVPQ GWQTGVNNQG ADALLIGGMG ATALPSGLVQ LGTLSFVAPT DADRLEIALT RGELGKQWLV PATITLESNV LASNGGYQHN ALWQGSYHLS VQHESTEEPT NMVTMSDAYA ALQIAAGHNP NESEAPLQSW QFLAADINRD GKVRASDALT ILKMALNYHD APSEELIFLP EWVGKSEMTR SSVDWSATEI MLDVENYQIV NLIGVIQGDV DGSFS
|
| |