Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1414 |
Symbol | |
ID | 3747173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 1882972 |
End bp | 1884003 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637773950 |
Product | hypothetical protein |
Protein accession | YP_379715 |
Protein GI | 78189377 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR00661] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTC TTTTTGGTGT TCAAGGAACC GGTAACGGTC ATATTAGTCG CAGTCGCGAG CTTGTTCGTG CGCTTAAAGA AGCTGGGCAC GAGCTTGAAG TAATTATTAG CGGACGCAAA GAAGAGGAGC TTAAAGAGAT TGAGATTTTT AAGCCCTATC GCGTTCTAAA AGGCATGACG TTGGTAACGC AAAAAGGGCG CTTGAACTAT GTTGACACCA TGGTGCAACT CGATTTTGTG CGCCTTGTTG CCGACATTGT AACGCTTGAT ACCGAAGGGG TTGATCTTAT TGTAACCGAC TTTGAGCCAA TCACCTCACT AACGGCAAAG TTGAAAAATA TTCCCTCAGT AGGCTTTGGG CATCAATACG CCTTCCGTTA CGATATTCCC GTAGCGCCAG GCTCTTTTTT TGAGAAATAT GCTTTGTTGA ACTTTGCTCC AGCCCACTAC AACGCTGGTT TGCATTGGCA CCATTTCTCC CAACCCATTT TTCCCCCAGT TATTCCCGAA ACCCTTTACG CAAAGCATCA TGTTGCCGTT ATTAGTAACA AAGTGCTGGT TTACTTGCCT TTTGAAGAGG TGGAGGATAT CACCACCTTT TTAACGCCCT TTACCGATTT TGAATTTTTT ATTTATGGTA AAGTGCAAGA GGGGAGCGAC CATGAGCATT TGCACTACCG CACCTACTCG CGCGAAGGTT TTCTTGCGGA TTTAATGGAA TGCACGGGCG TGGTATGTAA TGCGGGCTTT GAGCTACCGG GTGAAGCGTT GCACCTTGGC AAAAAAATGC TGTTGCGTCC GCTTGACGGG CAAATTGAGC AGCAATCAAA CGCGCTGGGA ATGGTGGAAC TTGGCTACGG CATGGCAATG GAGAGCCTTG ACCCCACAAT TTTAGCCGAT TGGTTGCAGC AACCTTGTCG TGAACCGTTA CGCTACGCAC GCACCGTCAA CTACATTGCC GAATGGATAA GTTACCGCCA TTGGGATGAG TTGGGGAAAT ACACGGCTAA GGCGTGGGTA GATCACGCAT AA
|
Protein sequence | MKILFGVQGT GNGHISRSRE LVRALKEAGH ELEVIISGRK EEELKEIEIF KPYRVLKGMT LVTQKGRLNY VDTMVQLDFV RLVADIVTLD TEGVDLIVTD FEPITSLTAK LKNIPSVGFG HQYAFRYDIP VAPGSFFEKY ALLNFAPAHY NAGLHWHHFS QPIFPPVIPE TLYAKHHVAV ISNKVLVYLP FEEVEDITTF LTPFTDFEFF IYGKVQEGSD HEHLHYRTYS REGFLADLME CTGVVCNAGF ELPGEALHLG KKMLLRPLDG QIEQQSNALG MVELGYGMAM ESLDPTILAD WLQQPCREPL RYARTVNYIA EWISYRHWDE LGKYTAKAWV DHA
|
| |