Gene Caul_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3039 
Symbol 
ID5900494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3301991 
End bp3303775 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content69% 
IMG OID641563541 
ProductTrkA domain-containing protein 
Protein accessionYP_001684664 
Protein GI167647001 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.170851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCTGC AACAAGGACT GTCTTTCGGG CTGATCGGCG CGACGGTGCT GTGTTTCATC 
TGGGGGCGTT GGCGCTATGA CCTGATCGCC TTGGGCGCCC TGGCTGTCGG GGTGGTTTCG
GGTCTGATCC CGATCAAGTC GGCCTTCGAC GGCTTTTCCA ACGACATCGT CGTGATCATC
GCCAGCGCCC TGGTGCTGAG CGCCGCGGTC GCGCGGTCGG GTCTGGTCGA TACGCTGATG
GCCCCCCTGC TGCCGCATCT GAAGGATGAG CGTACGCAGG TTCCGGTGTT GTCGACCGTC
ACCACCGTGC TGTCGATGGT CACCAAGAAC GTTGGGGCCC TGGCGTTGAT GATGCCCTCG
GCCCTGCGCA TGGCGCGCAA CACCGGCGTC TCGCCCGGCC GGTTGTTGAT GCCGATGTCG
TTCGGCTCAC TGGTCGGCGG GTTGGCGGTT CTGGTGGGCA CCTCGCCCAA CATCATTGTC
TCGGAGGTGC GTCAGCAAGC GCTGGGCAAG CCGTTCGCGA TGTTCGACTT CATGCCGGTG
GGCGGAACGC TGGCGGTGCT CGCGCTGCTA TACCTGGCCT TCGCCTATCG CCTGCTGCCC
AAGGAGCGGA CGGCCGCCAT CGACATCGAC GCGGCCCTGG CCGCCAACGC CTATGTCACC
GAGGTCGAAA TCCCCGAAGG CTGGTCCTTC GAGAGCTCGC GGGTCGCCGA CCTGCAGAAG
GCGGCGGGCG AGGCGGTGGT CGTGGTGGCC CTGCTGCGCG GCCGCAAGCG CATCGCCTCG
CCGCATCCCA ACCGCAAGAT CCTGCCGGGC GATGTGCTGC TGCTCGAGGG CCAGCAGCAG
GAACTGAACG AGCTGATCGT CAAGGCCAAG CTCAAGCTCA GCGACGCCCA TCGCCCCGTG
GTGATGGAGG AGCCGACCGA CGAGGTGCGG GTGGTCGAGG CGGTGATCGG CGCGGAGTCC
GATCTGATCG GCCAAACGGC CAGGGGAGTG ACGCTCAACG AGACCTATGG CGTCAACCTG
CTGGGCGTCA GCCGGGCCGG CTATCGCATG GCCGGCCGCC TGGCGACGGT GCGGATGAAG
GCCGGCGATA TCCTGGTGCT GCAGGGCGCC GAGCAACTGC TGCCCAACGC GCTTCAGGCC
CTGGGCTGCC TGCCGTTGGC CGAGCGCGAA GTGCGACTGG GCGGGTTTCG CCACGCCTTG
CTGCCGACCG GCATCCTCGC CGTGGCCATG TTGCTTGTCG CGTTCGGAGT CCTGCCCGTG
GCCGGCGGCT TCTTCGCCGC CGCCGTGCTG GTGGTGGCGA CCGGCGCGCT GCGCATGCGC
GAGGCCTATT CAGCGCTCGA TGCGCCGGTG CTGGTGCTCG TGGCCGCCCT GATCCCCGTC
AGCGACACGA TCCAGGCCAG CGGCGGGACC GACCTGATCG CCGGCTGGCT GTCGGGCGCG
TTCCACGGCT TGCCGCCGCT GCTGACCCTG ACGGCGATGA TGGCGGTGGC CATGGCCGCC
ACGCCGTTCC TCAACAACGC CGCCACGGTG CTGATCGTCG CCCCGATCGG CCTGGGCCTG
GCCAAGCACC TGGGCCTGAG CCCGGATCCC TTCCTGATGG CCGTGGCGGT GGGGGCAGGG
TGCGACTTTC TCACGCCCGT CGGCCACCAG TGCAACACGC TGGTGCTGGG GCCGGGCGGC
TACAGGTTTG GCGACTATGC GCGACTGGGC GCGCCGCTGA CCGTCCTGAT CCTGGTCGTG
GCGCCCAGCT TGATCGCCCT GGTCTGGCCG TTCGCGGGAC GGTGA
 
Protein sequence
MTLQQGLSFG LIGATVLCFI WGRWRYDLIA LGALAVGVVS GLIPIKSAFD GFSNDIVVII 
ASALVLSAAV ARSGLVDTLM APLLPHLKDE RTQVPVLSTV TTVLSMVTKN VGALALMMPS
ALRMARNTGV SPGRLLMPMS FGSLVGGLAV LVGTSPNIIV SEVRQQALGK PFAMFDFMPV
GGTLAVLALL YLAFAYRLLP KERTAAIDID AALAANAYVT EVEIPEGWSF ESSRVADLQK
AAGEAVVVVA LLRGRKRIAS PHPNRKILPG DVLLLEGQQQ ELNELIVKAK LKLSDAHRPV
VMEEPTDEVR VVEAVIGAES DLIGQTARGV TLNETYGVNL LGVSRAGYRM AGRLATVRMK
AGDILVLQGA EQLLPNALQA LGCLPLAERE VRLGGFRHAL LPTGILAVAM LLVAFGVLPV
AGGFFAAAVL VVATGALRMR EAYSALDAPV LVLVAALIPV SDTIQASGGT DLIAGWLSGA
FHGLPPLLTL TAMMAVAMAA TPFLNNAATV LIVAPIGLGL AKHLGLSPDP FLMAVAVGAG
CDFLTPVGHQ CNTLVLGPGG YRFGDYARLG APLTVLILVV APSLIALVWP FAGR