Gene Caul_3214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3214 
Symbol 
ID5900669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3475594 
End bp3477192 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content71% 
IMG OID641563719 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001684839 
Protein GI167647176 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.409706 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGT TCGACAGTAT CGAGGCCCAG CGCCGCATCG GCGGCCATCT GATCATGGCC 
TGTATCGCCG CGATGATCCT GGTGGTTCCG GCCGCGCGCC TGTTCGCCGG CGGCCCCGTC
GTCGGCCTGG CCCTGGGTTC GGTGGGGTTC GCGGCCATGG CCTGGGGCGG GTACGCCATC
TGGCGCGACA GCGTCGCCCA GCGGATCTCG ACCGCCATCA GCCTGGTCGG CCAAGTCACC
CTGTTCGTGG CCGCCTTCTC GGGCCATGCC TGGCAGGCCG ACGCCCGCAT GGCCTATCTG
GCCGCCTTGG CCCTGCTGGT CGCCTATGGC GATTGGCGCG TGGTGGCGAC CGGCGCTGTC
AGCGTCGTGG CGGTTGAGAT CGGCGCCTCC GTCCTGGCCC CCCATTTGCT GATTCCAGGC
GAGGTCTCAC CGCTGCGGAT CGCTTTCAAC GCCGGCGTGA CCCTGGTCAC CGCATGGTCC
CTGATCTGGC TGACGGCCGG CGTGTCGCGG CTGTTCGTCA CCGTCACCGC GCGGACCGAC
AAGGCGCTGG ACGCGGCGCG GGAAGCCGAC GCCGCCAATG TCGCCGCCGA GGCGGCCCGC
GCCGCCCGCG ACGCCGACAA CGCCGAACAG GCCGCCCAGA AGGCCGCCCT TGAGGCCGAA
CAGACTCTGG TGGTCGACAC CGTGGCCGAG GGCCTGGCCC ACCTGTCGCG CGGCGACCTG
ACCTGCCGCC TGACCCAGCC CTTCGCCGCG CGCTACGAGC CGCTGCGCAT CGACTTCAAC
GGCGCGATGG AGAAACTGCA GGCGGCGATG CGGGAGATCA CCGGCAACGC CTCCAGCATG
ACCGCCGGCG TGGCCGAGAT GGCCCGCGCC ACCGACGAGC TGGCCGACCG CACCGAACAG
CAGGCCGCCA GCTTGGTGGA GACCGTGGCG GCCCTCGACC AGATCACCGC CGCCGTCCGC
TCGACCGCCG ACGGCGCCCA CCAGGCCAAC GCCGCCGCCG CCAGCGCCCG CTCCGAGGTC
GAACGCTCCG ACCCCGTGGT CACCGAGGCT GTCGAGGCCA TGACCCTGAT CGAAGCCTCT
TCCGGCAAGA TCGGCCATAT CATCGGGGTG ATCGACGAGA TCGCCTTCCA GACCAATCTT
CTGGCCTTGA ACGCCGGGGT CGAAGCGGCC CGGGCCGGCG AGGCGGGCCG CGGCTTCGCC
GTCGTCGCCC AGGAAGTGCG AGCCCTGGCC CAGCGCTCGG CCGACGCGGC CAAGGAGATC
AAGGGCCTGA TCAACGAGTC GGGCGCCCAG GTCGCGGCCG GTGTCGAACG CGTGGGCCGC
ACGCGCGAGG CTTTGCAACG GATCGTCAGC GTGGTGGCCC AGATCGATCA ACAGGTCACC
GCCATCGCCC GCTCGGCCCA GGACCAGGCC CTGGGCCTGG GCGAGGTCAA CACCGCCATG
GCCGAGATGG ATCGGGTCGT GCAGCGCAAC GCCGCCATGG TCGAGGAAAC CACCGCCGCC
GCTCACGCGC TGCAGGGCGA AAGCCGCGAA CTTGGGCAAC GGATCGATCT GTTCGATATC
GGCCAGGCGC AAGCGGCGGG TGACCGCCGG GCGGCTTAG
 
Protein sequence
MIEFDSIEAQ RRIGGHLIMA CIAAMILVVP AARLFAGGPV VGLALGSVGF AAMAWGGYAI 
WRDSVAQRIS TAISLVGQVT LFVAAFSGHA WQADARMAYL AALALLVAYG DWRVVATGAV
SVVAVEIGAS VLAPHLLIPG EVSPLRIAFN AGVTLVTAWS LIWLTAGVSR LFVTVTARTD
KALDAAREAD AANVAAEAAR AARDADNAEQ AAQKAALEAE QTLVVDTVAE GLAHLSRGDL
TCRLTQPFAA RYEPLRIDFN GAMEKLQAAM REITGNASSM TAGVAEMARA TDELADRTEQ
QAASLVETVA ALDQITAAVR STADGAHQAN AAAASARSEV ERSDPVVTEA VEAMTLIEAS
SGKIGHIIGV IDEIAFQTNL LALNAGVEAA RAGEAGRGFA VVAQEVRALA QRSADAAKEI
KGLINESGAQ VAAGVERVGR TREALQRIVS VVAQIDQQVT AIARSAQDQA LGLGEVNTAM
AEMDRVVQRN AAMVEETTAA AHALQGESRE LGQRIDLFDI GQAQAAGDRR AA