Gene Caul_2605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2605 
Symbol 
ID5900060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2825708 
End bp2826838 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content71% 
IMG OID641563096 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_001684230 
Protein GI167646567 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATC ACGCCCGCAT CGCCCCCGCC AAGGTCGCCG ACGGTCTGAA GATCGCCGCC 
TTCGACCTCA GCCCCGAGCC CGCCCTGGTC GTCGACCGGG AGGGCGCCTT GGTCGCCGTC
AACGAGGCGG CCGAGGCGCT GTTTGGCCAG GGCCTGTCGC TGCTGGCTCG CGGCCGGTTC
CGCGCCGCCC TGCCGCCAGG CTCGGTCCTG GTCTCGATGA TGGACCGCGC GCTGTTCGAA
GGCGCTCTGG TCCGTGAGCA CGGGGTCGAG GTCAATCTGT TTGGCCAGCC GCCGTTCGAA
GCCGACGGCG CCGCCGCGCC GCTGGGCGAC GGCTCGGTGC TGCTGACCCT GCATGTCAAG
GGCGTGCTGG GCGTCGAGCG GGCCTCGGAC GCCGCCGGCC TCCGCTCGGT CGTCGGCCTG
GGCCGCATGC TGGCCCACGA GATCAAGAAC CCGCTGGCCG GCATTCGCGG CGCGGCCCAG
CTTCTGAAGA CCGGGGCCAG CGCCGCCGAC CAGCCCTTGG CCCAGCTCAT CGTCGATGAA
ACCGACCGCA TCCGCCGCCT GGTTGATCGC ATGGAGGCCT TCTCCGACGA AGTCCCGGGA
CCGCGCGAGG CGGTCAACAT CCACCAGGTG CTGGACCGCG TCCGGGCTCT GGTGGTCAAC
GGCGTCGCCG ACGGCCTGGA CCTGCGCGAA CACTACGATC CGTCGCTGCC TGACGTCTGG
GGCGACGAGG ATCACCTGAT TCAGGTGTTC CTGAACCTGG TCAAGAACGC CGCCGAGGCC
GCCCACGCGC GCGGCGACGG GCAGGGGACA CTGTCGATTC ACACCGCCTG GCGTCCAGGC
GTGCGGGTGC GCGGATCCGA TGGCAAGGCC GCCGCCGGAG CGCCGATCGA GATCCGCATC
CAGGACAACG GCCCCGGCGT GCCCGACAGC CTGCGCGACC ACCTGTTTCA GCCGTTCGTC
ACCACCAAGG CCAACGGCAC CGGCCTGGGC CTGGCCCTGG TCACCAAGCT GGTGACCAGC
CATGGCGGCC TGATCGACTT CGAATCCGAG CCCGGCCGCA CCGTGTTCCG CGTGCTGCTG
CCGATGGCGA CCGGAAAGCT CACCCGCTCT ACTGGAGACG CCCAAGCATG A
 
Protein sequence
MSDHARIAPA KVADGLKIAA FDLSPEPALV VDREGALVAV NEAAEALFGQ GLSLLARGRF 
RAALPPGSVL VSMMDRALFE GALVREHGVE VNLFGQPPFE ADGAAAPLGD GSVLLTLHVK
GVLGVERASD AAGLRSVVGL GRMLAHEIKN PLAGIRGAAQ LLKTGASAAD QPLAQLIVDE
TDRIRRLVDR MEAFSDEVPG PREAVNIHQV LDRVRALVVN GVADGLDLRE HYDPSLPDVW
GDEDHLIQVF LNLVKNAAEA AHARGDGQGT LSIHTAWRPG VRVRGSDGKA AAGAPIEIRI
QDNGPGVPDS LRDHLFQPFV TTKANGTGLG LALVTKLVTS HGGLIDFESE PGRTVFRVLL
PMATGKLTRS TGDAQA