Gene Caul_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1387 
Symbol 
ID5898842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1473499 
End bp1475346 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content67% 
IMG OID641561874 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001683015 
Protein GI167645352 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein
[COG4564] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATGG CGAAATTCCG CAATCTATCG ATCATGGCGC GCCTGCGCCT GGTGGTGCTG 
TTCGCCGGAA TCGGTCTGGC CGTGGCGATC GGCGTGGGCC TGCTGAACCT CTCGGCCGCC
ATGCACGAGG ATATCTCGCT CAAGACCCGC AGCCAGGTGG AGACGGCGGT GTCCGTCGCC
CAGCACTATG TCGACGAGGC CAAGGCCGGA CGGATGAGCG AGGCCGACGC CAAGATCGCG
GCGATCGGCG CGCTGAAGGC GATGCGCTAT GGCGGCAAGG AATATTTCTG GATCACCGAT
CTCGACACGC GGATGGTGAT GCACCCCAAC AAGCCGGCGT TGGATGGCGC CGACGTGTCC
AAGGAACTCG ACCCGACGGG CAAGGCGCTG TTTTCCGAGA TGACCAAGGT CGCCACGTCT
CAAGGCGCCG GCTTCGTCGA CTACATGTGG CCCAAGCCCG GTCACGACAA ACCGCAGCCC
AAGATTTCGT ACGTCGCGCT GATGCCCGCC TGGGGGTGGG TGATCGGAAC GGGGGTGTAT
GTCGATGACA TCGACGACGC GATCGGGATG GCCGCCCTGA AACTGGCCGG GATCGGCCTG
GCGCTGCTGC TGGTCGTCGG CCTGGGCGCG ACCCTGCTGG GCGTGACCAT CACCCGTCCG
ATCATGACCC TGACCCAGCG CATGAGCGGG CTGGCCAAGG GCGACAAGGA CAGTCAGGTC
CCCTTCACCG ATCTGGCCAA TGAAACCGGC GAGATGGCCC GCGCGCTGGC GATCTTCCGT
GACGCGGCGC TTGATCGCGA GCGCCTGGAA GTCGAAGCGG AGGCCATGCG TGGCGAAGCC
GCGGCCGATC GTCAACTGCG GGAAGCCGCC GATCGCGCGG CGGCGGAAGT TCAACAGCGG
GTCGTCACCG ATGTCACCGC CGTGGCGGGG CGGCTGGCCG CCGGCGACCT GACCGTGCGG
CTGAGCGATG ACTTCCCAAG CGGCTACGCC GAACTGCGTG AAAATCTGAA CGCCGCGCTT
GTGCAACTGG CGTCCGCCAT GAAGGCCGTG CGCGACAACG CCCACGGCAT CCAGCATGGC
GCCGACGACA TCGCCTCGGC CTCCGACAAT CTGTCGCGCC GGACCGAGCA GCAGGCCGCC
ACCCTCGAAG AGACCACAGC GGCGCTGGGC GAACTGACCA GCACCGTGCG GCGCTCGGCC
GAGGACGCCA GCCAGGCCCG CGCCGCGGTC GCCGTGGCTC AGGAGCAGGC CCAGTACAGC
GGGACCGTCG CCGACCAGGC GGTGGCCGCC ATGGGCGAAA TCGAGGGATC GTCGCAACAG
ATCCAGCAGA TCATCGGCGT GATCGACGAG ATCGCCTTCC AGACCAATCT CCTGGCTCTG
AACGCCGGCG TCGAAGCCGC GCGAGCCGGC GACTCCGGCC GGGGCTTCGC GGTCGTCGCC
CAGGAGGTTC GCGCCCTGGC CCAACGGTCG GCCGAAGCGG CCAAGGAGAT CAAAAGCCTG
ATCGGCGCGT CTTCCCGGCA GATCGGCGAC GGCGTCACCC TGGTGCGGGA CATGGGCGGC
GCCCTGCAGG ACATCGTGGG CAAGGTGAAC GAAATCGACG TGCTGATGCG CGGCATCGCC
GGCCTCGCCG CCGACCAGTC CGAGGGCCTG AGCCAGATCA ATATCGCCAT GCTGCAGATC
GACCAGAACA CGCAGCAGAA CGCGGCCATG GTCGAGGAGG CGACCGCGGC GGTGCACTCG
CTGAAAAACG AGACCAATGA ACTGGCGGAT CTGGTCGGCC GCTTCGAACT TGATGAGGCG
AGCCCCCACG TCGTCGCCGC ATCCGACCGT CGCCGATTGC GCGCCTAG
 
Protein sequence
MSMAKFRNLS IMARLRLVVL FAGIGLAVAI GVGLLNLSAA MHEDISLKTR SQVETAVSVA 
QHYVDEAKAG RMSEADAKIA AIGALKAMRY GGKEYFWITD LDTRMVMHPN KPALDGADVS
KELDPTGKAL FSEMTKVATS QGAGFVDYMW PKPGHDKPQP KISYVALMPA WGWVIGTGVY
VDDIDDAIGM AALKLAGIGL ALLLVVGLGA TLLGVTITRP IMTLTQRMSG LAKGDKDSQV
PFTDLANETG EMARALAIFR DAALDRERLE VEAEAMRGEA AADRQLREAA DRAAAEVQQR
VVTDVTAVAG RLAAGDLTVR LSDDFPSGYA ELRENLNAAL VQLASAMKAV RDNAHGIQHG
ADDIASASDN LSRRTEQQAA TLEETTAALG ELTSTVRRSA EDASQARAAV AVAQEQAQYS
GTVADQAVAA MGEIEGSSQQ IQQIIGVIDE IAFQTNLLAL NAGVEAARAG DSGRGFAVVA
QEVRALAQRS AEAAKEIKSL IGASSRQIGD GVTLVRDMGG ALQDIVGKVN EIDVLMRGIA
GLAADQSEGL SQINIAMLQI DQNTQQNAAM VEEATAAVHS LKNETNELAD LVGRFELDEA
SPHVVAASDR RRLRA