Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1052 |
Symbol | |
ID | 3747033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 1423658 |
End bp | 1424986 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637773581 |
Product | hypothetical protein |
Protein accession | YP_379357 |
Protein GI | 78189019 |
COG category | [R] General function prediction only |
COG ID | [COG2081] Predicted flavoproteins |
TIGRFAM ID | [TIGR00275] flavoprotein, HI0933 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.843089 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAC GTTGGGTAGT GGTGATTGGT GGGGGAGCGG CTGGTATGGC GGCTGCCGTT TCAGCCGCAG AGCAAGCCCG TTATCTTGGT GTGGATTGCC ATATTACGGT GATTGAAAAA ACGCATCAAG TGGGTTCTAA AATCCGCATT TCAGGTGGTG GAAAATGCAA TGTTACGCAT GTTGGTACAT CGGCTGAGTT GCTTGAAAAA GGGTTTCTCC GTGCAGCGGA GCAGCGCTTT TTACGTTCCG CACTTTACGC ATTTTCCAAT AACGAACTCC GTGCACTCTT GCAGCAGCAG GGGGTGGCTA CTACTGAGCG AGAGGATGGT AAGGTGTTTC CCGTAGCAGG TGAGGCAAGC GTGGTGGCTG AGGCATTCCG TACCTTATTG CAGCGCTTGA AAATCAATTG CGAGCTGCAT GCGCCAGTGC AAGCCATAAA AGTGCATGGG CAGCAATTCC ACCTTATTAC TCTTCACGGC GACATTGTTG CCGATGCGGT TATTGTGGCA ACGGGTGGGG TTTCGTATCG GCATACGGGC ACTACGGGCG ATGGTTTGCG CTTAGCGCGT GCATTGGGGC ACACGGTTGT AGAGCCTTCA GCGGCACTTT CCTCCATTAT GGTGCAGCCG CATTCGTTAG TTGCGCTTGC GGGTGCGGCG CTACGTGGCG TTGCAGCCGT AGCGCGTGCA GGCAAGTTGC GGGCTGAGCG GCAGGGTGAT ATTCTCTTTA CCCATCGAGG CTTTAGTGGT CCAGCAATGC TATCGCTTTC GCGCGATGTT GCCAATATGC AGCGCTCCCA ACGTGAGGCG GTGCACCTTG CGGCTGACCT CTATCCCCAG CAATTGCATG ATGAACTTGA GGCGTTGTTG CTCCAACATA GTAAAAAGCA AGGTGGGCAG TTGGTGCGCA AATTCTTGCA AGTGTCGCCC ATTGGCATGT TGTTGCTCAA GAGCGAAACC ATGCCATATG GCACCATTCC CAATGCCATG GTGCCACTGC TAATGCGTCA GGCGGCGCTT GATGACGAGG TAACCTTTGC TACGTTAAGC CGTGAGCATC GCCATCAATT GGTGGTTACC TTAAAGCAGT TTCAGCTTGG TACGGTTCAT AACGTGTCGT TGGATGCAGG GGAAGTTTCG GCTGGTGGAG TAGCGCTTAG TGAAGTGAAT CCTAAAAGCA TGGAGTCGCG CCTTGTGCCA AATCTTTATT TTTGCGGGGA AGTGTTGGAT TATGTGGGGG AAATTGGAGG CTATAATTTA CAAGCTGCTT TTTCAACGGG ATGGATGGCT GGAAAAAGTG CTGTGAACAA GCTTTTAACA GCTCTTTAA
|
Protein sequence | MKERWVVVIG GGAAGMAAAV SAAEQARYLG VDCHITVIEK THQVGSKIRI SGGGKCNVTH VGTSAELLEK GFLRAAEQRF LRSALYAFSN NELRALLQQQ GVATTEREDG KVFPVAGEAS VVAEAFRTLL QRLKINCELH APVQAIKVHG QQFHLITLHG DIVADAVIVA TGGVSYRHTG TTGDGLRLAR ALGHTVVEPS AALSSIMVQP HSLVALAGAA LRGVAAVARA GKLRAERQGD ILFTHRGFSG PAMLSLSRDV ANMQRSQREA VHLAADLYPQ QLHDELEALL LQHSKKQGGQ LVRKFLQVSP IGMLLLKSET MPYGTIPNAM VPLLMRQAAL DDEVTFATLS REHRHQLVVT LKQFQLGTVH NVSLDAGEVS AGGVALSEVN PKSMESRLVP NLYFCGEVLD YVGEIGGYNL QAAFSTGWMA GKSAVNKLLT AL
|
| |