Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1486 |
Symbol | |
ID | 3747981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 1957117 |
End bp | 1958538 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637774025 |
Product | toluene transport protein, putative |
Protein accession | YP_379785 |
Protein GI | 78189447 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2067] Long-chain fatty acid transport protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000112602 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CAACCTCACG TTTTGCAATG GCTTGTTGTG CCTTGCTTTG TGCTCCTTAT TCCGCTTTTG CCACCAATGG CATGAATCTT GAAGGGTATG GAGCTATTTC CCATGCGCTT GGTGGTACGG GTTCGGCTTA CAATACGGGT AACTCGGGGG TAAGCAATAA CCCTGCCACG TTAGCATTGC GTAAAAGCAA GAGCACGCAA TTAGGCTTTG GGTTGCGTGG CTTGCATCCC GATGTGTCGT TACAGGCTAA TGGTATATCG CAAAGTTCGG CGGGAGATGC TTATTACATG CCATCGCTTT CGTGGATGCA CAAAGGGTCA GCGGTAACGT GGGGTGTTGC AATGTTGGCG CAGGGCGGTA TGGGTACGGA ATATGGTAAG GGTTCGCCAC TCTTTAGCAT GGGAAAGCCA CTTTCAGGGG TGGGTATGGT GCCAATGAGT GGTGAGGAGA TTCGCACGGA GGTTGGTGTT GGTCGTGTTA TGTTCCCCAT TGCATGGAAT GTGTCGGAAA ATACCACTAT TGGTGCCTCG TTTGATGTGG TGTGGGCTGG TATGGATTTA ATGATGGATA TGGATGGTGC TCATTTTGCG AGCATGATGG GTAGTGGAAA TGTAAATGGC ACTATGGCGA CCACGCTTGG AACCATGATG GCACCGGGTG GAGGGGTAAC GGATGTCAAT TATGTGCGCT ATAATTTTTC CAACAATAAT GCTTTTTGGG GTGAAGCAAG TGGCTACGGC ACAGGGTTTA AGCTGGGCAT TACGCATCGG TTAAGTAAAG TGGTAACGGT TGGCGGCAGC TTTCAATCAA AAACAGCAAT GAGCGATTTA AAAGCTACCA AAGCTGAACT TTCATTTGCG GGGGTTGATG GCACTGGCTC TACCTTTACG CAAAAAGTAA ACGGTACCAT TAAGGTGCGC AATTTTGAAT GGCCTACCAC CATTGCTGCT GGTGTAGCAC TTTATCCATC CGATCGCTGG ATGGTTGCGG CGGATGTTAA GCATCTTGGT TGGGCATCGG TCATGCGCTC CTTTTCAACC TCCTTTGAAG CTGATAACAC AGCCGCAAAT GGTGGATTTG CTGGGCAGGA GCTTGAGGTT GCGATGAAGC AGGATTGGGA TGACCAAACG GTGTTTGGTT TTGGTGTGCA ATATCGTGCA AGCGATCGCT TGGTGTTGCG TAGTGGCGCA AGCTTTTCAT CAAATCCTGT GCCAAATGCC TACCTTAACC CAATGTTTCC TGCTACAACC GAAAACCATT ACACTGCCGG TTTTGGTTAC CGCTTAAGCG ATGCAGCCAC TATTGCGCTT GCTGGTGCAT GGGTGCCAAA AGTTAGCGCT CGCAATGGTG ATGGGGTAGA GGTTCATCAT AGCCAAACAA ATTGGTCGTT GAACTATACG CAAGCTTTGT AA
|
Protein sequence | MKKTTSRFAM ACCALLCAPY SAFATNGMNL EGYGAISHAL GGTGSAYNTG NSGVSNNPAT LALRKSKSTQ LGFGLRGLHP DVSLQANGIS QSSAGDAYYM PSLSWMHKGS AVTWGVAMLA QGGMGTEYGK GSPLFSMGKP LSGVGMVPMS GEEIRTEVGV GRVMFPIAWN VSENTTIGAS FDVVWAGMDL MMDMDGAHFA SMMGSGNVNG TMATTLGTMM APGGGVTDVN YVRYNFSNNN AFWGEASGYG TGFKLGITHR LSKVVTVGGS FQSKTAMSDL KATKAELSFA GVDGTGSTFT QKVNGTIKVR NFEWPTTIAA GVALYPSDRW MVAADVKHLG WASVMRSFST SFEADNTAAN GGFAGQELEV AMKQDWDDQT VFGFGVQYRA SDRLVLRSGA SFSSNPVPNA YLNPMFPATT ENHYTAGFGY RLSDAATIAL AGAWVPKVSA RNGDGVEVHH SQTNWSLNYT QAL
|
| |