Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1094 |
Symbol | |
ID | 3747961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1480443 |
End bp | 1481729 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637773625 |
Product | hypothetical protein |
Protein accession | YP_379399 |
Protein GI | 78189061 |
COG category | [S] Function unknown |
COG ID | [COG1322] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.437038 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGAAA CGCTCTCTTT TTTCCTGCTC CTTGCTCTTT TTGGGCTTCT CATATTTCTT GCTATTCGTT TGCTTAGCGC CGCTCCTAAG CAGGCTGAGT TGGCGCGTTT GCAGGCTCTT GAAGCCGATG CTCTTCGCAA TATGGAGCGC TTAACTGCCC TTGCTGAGGA GCATGAAGCG TTAAAAATTC GTTATGCACG TCTTGAGGTG GCTTACCAAA ATGAGCAACA ATCAGTGGCT GAAAAAGAGG CGTTGATGCT TGCTTCTGAA GAGCGGTTAA AAAAGGAGTT TGAGTTGCTT TCGCTGCGCA TTTTGGAAGA GCGTGGAAAA GCGCTTGGGG CTGAGCAGCG TGAACGGCTT GATACTCTTT TGCTGCCGTT ACGCCAGCAG CTTGAAGCCT ATCGCCAACG CATTGAAGAG GTGCATCATG CGGACACGTT GCTTTCGGGG CAGCTTATTG AAGAGGTGCG GCAGTTGCAG GCATTAAGTA GTCGCGTGAG TAACGATGCC CAACAGCTTG CCCATGCCAT TAAAGGTGAT TCAAAAGTGC AGGGTAATTG GGGGGAAATA ATTATTGAGC GAATGTTTGA AGCTTCGGGG CTTGAAAAAG GGCGCGAATA TCTTGCGCAA GAGAGTTTTC GTGATAGCGA TGGTGCCCTT AAGCGTCCCG ATTTTATGGT GCTTTTGCCC GATAACAAAG CTATTATTGT GGATTCAAAA GTCTCTCTTA CTGCTTTTGA GCGTTATAGT GCGCTAAGCG ATCCCGATGA GCAGCAAATT GCGTTGCGTG AGCATCTGCA ATCGGTGCGG CGCCACATTA CCGAGTTGCA AGCAAAAAAC TATCATGAAT TGGGAGGCAA CCGCACGCTT GATTTTGTGT TGCTCTGCAT TCCCATAGAA GCAGCATGGC AAGCGGCGAT GCAAGCCGAT CCCGCATTAC TTTATACGCT TGCAGGGCGT AACGTGGTGG TTTGTAGCCC TACAACGCTG ATGATGACCC TCAAGCTTAT TGCTCAATTG TGGCGACGTG AGCACGAAAA CCGCAATGCT GAACTTATTG CCGAAAAGGC AGGGCGCATT TACGATCAAG TGGCTTTGCT TGCTCATAGT ATGTTGGAGG CACAAAAAAA GTTAAGCAAT GTTAATGATT CGTTTGAACA GGTGTTAAAG CAGCTTAAAA CAGGGCGTGG AAATTTAATT GGGCGCGTGG AGGAGATTCG TAAGCTTGGG GCTAAAGTGA ATCGCCAAAT GCCGCTTGAT GTTACAGCAG AGGCGTTAGA GGAGTAA
|
Protein sequence | MLETLSFFLL LALFGLLIFL AIRLLSAAPK QAELARLQAL EADALRNMER LTALAEEHEA LKIRYARLEV AYQNEQQSVA EKEALMLASE ERLKKEFELL SLRILEERGK ALGAEQRERL DTLLLPLRQQ LEAYRQRIEE VHHADTLLSG QLIEEVRQLQ ALSSRVSNDA QQLAHAIKGD SKVQGNWGEI IIERMFEASG LEKGREYLAQ ESFRDSDGAL KRPDFMVLLP DNKAIIVDSK VSLTAFERYS ALSDPDEQQI ALREHLQSVR RHITELQAKN YHELGGNRTL DFVLLCIPIE AAWQAAMQAD PALLYTLAGR NVVVCSPTTL MMTLKLIAQL WRREHENRNA ELIAEKAGRI YDQVALLAHS MLEAQKKLSN VNDSFEQVLK QLKTGRGNLI GRVEEIRKLG AKVNRQMPLD VTAEALEE
|
| |