Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3039 |
Symbol | |
ID | 5900494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3301991 |
End bp | 3303775 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641563541 |
Product | TrkA domain-containing protein |
Protein accession | YP_001684664 |
Protein GI | 167647001 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.170851 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCTGC AACAAGGACT GTCTTTCGGG CTGATCGGCG CGACGGTGCT GTGTTTCATC TGGGGGCGTT GGCGCTATGA CCTGATCGCC TTGGGCGCCC TGGCTGTCGG GGTGGTTTCG GGTCTGATCC CGATCAAGTC GGCCTTCGAC GGCTTTTCCA ACGACATCGT CGTGATCATC GCCAGCGCCC TGGTGCTGAG CGCCGCGGTC GCGCGGTCGG GTCTGGTCGA TACGCTGATG GCCCCCCTGC TGCCGCATCT GAAGGATGAG CGTACGCAGG TTCCGGTGTT GTCGACCGTC ACCACCGTGC TGTCGATGGT CACCAAGAAC GTTGGGGCCC TGGCGTTGAT GATGCCCTCG GCCCTGCGCA TGGCGCGCAA CACCGGCGTC TCGCCCGGCC GGTTGTTGAT GCCGATGTCG TTCGGCTCAC TGGTCGGCGG GTTGGCGGTT CTGGTGGGCA CCTCGCCCAA CATCATTGTC TCGGAGGTGC GTCAGCAAGC GCTGGGCAAG CCGTTCGCGA TGTTCGACTT CATGCCGGTG GGCGGAACGC TGGCGGTGCT CGCGCTGCTA TACCTGGCCT TCGCCTATCG CCTGCTGCCC AAGGAGCGGA CGGCCGCCAT CGACATCGAC GCGGCCCTGG CCGCCAACGC CTATGTCACC GAGGTCGAAA TCCCCGAAGG CTGGTCCTTC GAGAGCTCGC GGGTCGCCGA CCTGCAGAAG GCGGCGGGCG AGGCGGTGGT CGTGGTGGCC CTGCTGCGCG GCCGCAAGCG CATCGCCTCG CCGCATCCCA ACCGCAAGAT CCTGCCGGGC GATGTGCTGC TGCTCGAGGG CCAGCAGCAG GAACTGAACG AGCTGATCGT CAAGGCCAAG CTCAAGCTCA GCGACGCCCA TCGCCCCGTG GTGATGGAGG AGCCGACCGA CGAGGTGCGG GTGGTCGAGG CGGTGATCGG CGCGGAGTCC GATCTGATCG GCCAAACGGC CAGGGGAGTG ACGCTCAACG AGACCTATGG CGTCAACCTG CTGGGCGTCA GCCGGGCCGG CTATCGCATG GCCGGCCGCC TGGCGACGGT GCGGATGAAG GCCGGCGATA TCCTGGTGCT GCAGGGCGCC GAGCAACTGC TGCCCAACGC GCTTCAGGCC CTGGGCTGCC TGCCGTTGGC CGAGCGCGAA GTGCGACTGG GCGGGTTTCG CCACGCCTTG CTGCCGACCG GCATCCTCGC CGTGGCCATG TTGCTTGTCG CGTTCGGAGT CCTGCCCGTG GCCGGCGGCT TCTTCGCCGC CGCCGTGCTG GTGGTGGCGA CCGGCGCGCT GCGCATGCGC GAGGCCTATT CAGCGCTCGA TGCGCCGGTG CTGGTGCTCG TGGCCGCCCT GATCCCCGTC AGCGACACGA TCCAGGCCAG CGGCGGGACC GACCTGATCG CCGGCTGGCT GTCGGGCGCG TTCCACGGCT TGCCGCCGCT GCTGACCCTG ACGGCGATGA TGGCGGTGGC CATGGCCGCC ACGCCGTTCC TCAACAACGC CGCCACGGTG CTGATCGTCG CCCCGATCGG CCTGGGCCTG GCCAAGCACC TGGGCCTGAG CCCGGATCCC TTCCTGATGG CCGTGGCGGT GGGGGCAGGG TGCGACTTTC TCACGCCCGT CGGCCACCAG TGCAACACGC TGGTGCTGGG GCCGGGCGGC TACAGGTTTG GCGACTATGC GCGACTGGGC GCGCCGCTGA CCGTCCTGAT CCTGGTCGTG GCGCCCAGCT TGATCGCCCT GGTCTGGCCG TTCGCGGGAC GGTGA
|
Protein sequence | MTLQQGLSFG LIGATVLCFI WGRWRYDLIA LGALAVGVVS GLIPIKSAFD GFSNDIVVII ASALVLSAAV ARSGLVDTLM APLLPHLKDE RTQVPVLSTV TTVLSMVTKN VGALALMMPS ALRMARNTGV SPGRLLMPMS FGSLVGGLAV LVGTSPNIIV SEVRQQALGK PFAMFDFMPV GGTLAVLALL YLAFAYRLLP KERTAAIDID AALAANAYVT EVEIPEGWSF ESSRVADLQK AAGEAVVVVA LLRGRKRIAS PHPNRKILPG DVLLLEGQQQ ELNELIVKAK LKLSDAHRPV VMEEPTDEVR VVEAVIGAES DLIGQTARGV TLNETYGVNL LGVSRAGYRM AGRLATVRMK AGDILVLQGA EQLLPNALQA LGCLPLAERE VRLGGFRHAL LPTGILAVAM LLVAFGVLPV AGGFFAAAVL VVATGALRMR EAYSALDAPV LVLVAALIPV SDTIQASGGT DLIAGWLSGA FHGLPPLLTL TAMMAVAMAA TPFLNNAATV LIVAPIGLGL AKHLGLSPDP FLMAVAVGAG CDFLTPVGHQ CNTLVLGPGG YRFGDYARLG APLTVLILVV APSLIALVWP FAGR
|
| |