Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0447 |
Symbol | |
ID | 3747372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 524280 |
End bp | 525518 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637772980 |
Product | hypothetical protein |
Protein accession | YP_378763 |
Protein GI | 78188425 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.607814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTACC TTTTTGTTCA CCAAAATTTT CCCGGGCAGT TCAAATTTCT TGCCCCAACA TTAGCCGCTA ATAAGAGCAA CAAAGTGGTA GCGCTTTGCA TGAAGCCGCA AGCGCCTACC ATATGGCAAG GCGTTGAGGT TCGTAGCTAT AGCGCCAATC GAGGCACAAC AAAAGGGGTG CATCCGTGGG TGAGCGATTT TGAAACCAAA ACCATTCGGG CGGAAGCCTG CTTTATGGCA GCGCAACAGC TTAAAGCTGA AGGCTTTACG CCCGACGTTA TTATTGCGCA TCCCGGCTGG GGCGAAAGCA TGTTTTTAAA AGAGGTATGG CCTCATGCAA AGCTTGGCAT CTATTGCGAA TTTTACTACC ACCCCGAAGG TGCTGATGTT GGCTTTGATC CTGAATTTCC ACCGAAAAGT GAGAGCGACC GTTGCCGCTT GCGGTTAAAA AACCTCAACA ACATTGTACA CTTTCAAATT GCCGATGCGG GACTTTCACC AACTCATTGG CAGGCAAGTA CTTTTCCCGA ACCATTTCGC TCCCGCATTA CCGTTGCCCA CGATGGCATT GATACCACGC TGCTCTCTTC AAACGTAGCA GTGCGCTTAA CGCTCAATAA TAGCCTGACA CTCACTCGTA AGGATGAAGT TATCACCTTT GTAAATCGCA ACTTAGAGCC ATATCGAGGC TACCACGTTT TTATGCGAGC ACTGCCCGAA CTGCTGCAAC AGCGCCCAAA TGCACGGGTG CTTCTTGTGG GGGGCGACAA GGTCAGTTAC GGCGCTAAGC CTGAGGGAGA AGAAAGCTGG AAGGAGCACT TTATTGCCGA AGTGCGTCCA CGCATCAGCG ATGCCGATTG GGCACGAGTT CACTTTCTTG GAACTATTCC CTACAACATT TTTGTTCAGT TGCTCCAACT CTCCACCGTA CACATTTATC TCACCTATCC CTTTGTGCTT TCATGGAGTT TGCTTGAAGC CATGAGCATT GGGTGCGCCA TTGTTGCCAG CAACACCAAG CCGCTGCTTG AAGCCATTCA CCACAATGAA ACAGGGCAAC TTGTTGATTT TTTTGATGAA AAAGGATTGG TTGAGAACAT TTGCGAGTTG CTTGATAACA CCAATGAACG CGCACGGCTT GGCGCTAATG CCCGACGCTT TGCGCAAGCC ACCTACGACT TACGCACCAT CTGTTTACCG CAACAGCTTG CATGGGTTGA GAGCTTGAGC AAAAAATAG
|
Protein sequence | MRYLFVHQNF PGQFKFLAPT LAANKSNKVV ALCMKPQAPT IWQGVEVRSY SANRGTTKGV HPWVSDFETK TIRAEACFMA AQQLKAEGFT PDVIIAHPGW GESMFLKEVW PHAKLGIYCE FYYHPEGADV GFDPEFPPKS ESDRCRLRLK NLNNIVHFQI ADAGLSPTHW QASTFPEPFR SRITVAHDGI DTTLLSSNVA VRLTLNNSLT LTRKDEVITF VNRNLEPYRG YHVFMRALPE LLQQRPNARV LLVGGDKVSY GAKPEGEESW KEHFIAEVRP RISDADWARV HFLGTIPYNI FVQLLQLSTV HIYLTYPFVL SWSLLEAMSI GCAIVASNTK PLLEAIHHNE TGQLVDFFDE KGLVENICEL LDNTNERARL GANARRFAQA TYDLRTICLP QQLAWVESLS KK
|
| |