Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_2035 |
Symbol | |
ID | 3746082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 2265697 |
End bp | 2266887 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637770066 |
Product | hypothetical protein |
Protein accession | YP_375920 |
Protein GI | 78187877 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000607972 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.528651 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCCG GCGGCAAGGA GTATGTTTTT CACAGCGTGG AAACCTTTAG TACATTAAGA AAACAAGAGA CATTGATTTT TTTCAAGGCA CCCAACAATC ACGCACCCAT GACACGACCG GGCCGACTGC TCATCCTCAC CGCAAGCATC TTGCTGCTTC AGGATGCCCC GCTTCCGGGC CTGCACCATA ATACGGCTCA TGCGGAGAAA AAGAAAATCA TTCTCCGCCA CGCCGACACG ATCGAAGGCG GCGAGGATGC CGGCGGCAGC TTCCGCGCGG CCATCGGCAA CGTGTTGTTC GAGAAGGACA ACCTGACATT GAGTTGCGAC CGCACCACCG ACTATGAGGG ACAGGACCGC ATTGTGATGA ACGGCCATGT CCGCATCTCC GATGGCGCCA AAGAGATCTT CGGCGACGAT GGGGTCTTCC ACCCATCCAC CGACGTCTGC GAACTCCATG GCAACGTCCG CGGCCGGATG CTGGACAACT CCATGATGGT CCGGTCCAGA AAAGCAGTCT TCAACAACCA GGAAAACCGC CTCTGGTTCT ACGACAACGC CATCGGGTGG CGTTACAACG AACAGGTCAG CGGAGACATC CTGCGGATCC AATTCAAGCC TGTGGATGAG AATGCACCGT CGAAAGATGG TGGCGGCAAA CAGAAAATCG ACGAAATGCA GGTGCACGGC CACGCATTCT ACGCCACACG CGACACGCTG ACGGCCGACC CGGAAGGCTA TGACCAGCTC AGCGGGCGCC ATATCGTGGT GCTCATCGGA GATGACTCGA AAGTAAAAGG GGTCACGGTG ACCAAAGAAG CCGAAAGCCT AGTCCATATT TATGACAACG ACGGCATGGA GAAGCCGGAC GGCAGCGGGA AAGGCAACAG CAGTGACAAG GGCAAGGGCT CGGACGATGG CGCGAAAAAG CTGAGCGGCA TCAACTACTC GAGCGGCGAC AGAATCAAGA TGATCTTCAA GGAAGGCACT CTGAAGAAAA TGAATGTCAC CGGAAATGCC GAGGGAACTG AATACCCTCC CGAGCTGCGC GGATCGGTCA ACCTCCCGAA ATTCAAATGG CGGGAAGACG AGCGACCATT CGGAAAACAG GCCGCCGGGG ATAGAACGGC CGCAGGCATC AAGACCGTGG ACAAGGGTGC CGAGGCGCCT GAAAGAGAAA GCCTTCCGTA A
|
Protein sequence | MAAGGKEYVF HSVETFSTLR KQETLIFFKA PNNHAPMTRP GRLLILTASI LLLQDAPLPG LHHNTAHAEK KKIILRHADT IEGGEDAGGS FRAAIGNVLF EKDNLTLSCD RTTDYEGQDR IVMNGHVRIS DGAKEIFGDD GVFHPSTDVC ELHGNVRGRM LDNSMMVRSR KAVFNNQENR LWFYDNAIGW RYNEQVSGDI LRIQFKPVDE NAPSKDGGGK QKIDEMQVHG HAFYATRDTL TADPEGYDQL SGRHIVVLIG DDSKVKGVTV TKEAESLVHI YDNDGMEKPD GSGKGNSSDK GKGSDDGAKK LSGINYSSGD RIKMIFKEGT LKKMNVTGNA EGTEYPPELR GSVNLPKFKW REDERPFGKQ AAGDRTAAGI KTVDKGAEAP ERESLP
|
| |