Gene Saro_2275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2275 
Symbol 
ID3916593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2415703 
End bp2416887 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content67% 
IMG OID640445031 
Producthistidine kinase 
Protein accessionYP_497546 
Protein GI87200289 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0585084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGGTT TGTCGCTGCC TCTCCCCAGC ATGTTCGTCG CCCTGTGCGC GGCCGTGGTC 
ATGTACTTCG GCGGCGCCGG CTTCTGGCTC TCGCTCGCGA TCCTGCTCGT CTGGCTGGCG
ACACTGTGGC TCGCCCGACC CGAGCCGACG GTGGAAACCC GCAGCAGGGA TGACGGCAGC
GTCTCGCGCC AGGCAATGAT CGAACTGGTC GAACCCTTCG GCCTGCCGGT CCTGATGCTG
GACGGACAGC GCATCGCAGC CGCCAACGCG GCCGCGCGGG AGGAACTCGG CAGCCATATC
GTCGGCCAGG ACGCGCGCGT GGCGCTGCGC CATCCCGAAG CGGTCCGCCT CCTGGACAAG
CCCGAGGGCC GGGCGCTGGT GCGGGGTCTC ACGGGCGCGC GCAGCATCTG GCAGGTAAGC
CGCGTGCCGA TCGACGAACG CTTCTCGCTG ATCGAGTTCG TCAACCGCAC GGCAGAGGCC
GATATCAGCC GCGCGCATAC CGACTTCGTG GCCAACGCCA GCCACGAACT GCGCACCCCG
CTCGCCTCGA TCATCGGCTA TATCGAGACG CTGGCCGATC CCGACGCCAA AGTCGACGAA
GCAACCGCGG CGCGCTTCCA TGCCACGGTG CTGCGCGAGG CACGGCGTCT GCAAAGCCTG
GTCGAAGATC TCATGTCGCT TTCCCGGATC GAGGCCGAGA AGCACGAGTT GCCGCGCGAT
CGCATCGATC TCGGCCAGCT TGTCGGCAGC ATCGCCAGCG AAACAGCGAT GACCGTGGGC
GACGGGCGCC TCGAAGTCGA GACGTGCCCC GCGCTCGTGG CGGGCGACCG GCAGCAGCTT
GACCAACTCG TGCGCAATCT GATCGACAAC GCGTTCAAGT ATGGAGACAC TGCCGCCCCT
GTCGCGGTCA AGGTGGCGAT TCACGGCAAC GAAGCGGAGC TGTCGGTGAC CGACAGGGGC
GAAGGCATCC ACCCCGACCA CCTGCCCTAT CTCACCCGGC GCTTCTATCG GACCGACCCG
GGACGCAGCC GCGCGGCGGG CGGGACGGGC CTCGGGCTCG CCATCGTGAA GCACATCGTG
GAGCGGCATC GCGGCAAGCT GGACATCGCC AGCCAGCTTG GAATCGGCAC GACGGTTACC
GTCAGATTGC CGATTGCAAA CCTGCCCGCT GTTGCTGCAG CCTGA
 
Protein sequence
MKGLSLPLPS MFVALCAAVV MYFGGAGFWL SLAILLVWLA TLWLARPEPT VETRSRDDGS 
VSRQAMIELV EPFGLPVLML DGQRIAAANA AAREELGSHI VGQDARVALR HPEAVRLLDK
PEGRALVRGL TGARSIWQVS RVPIDERFSL IEFVNRTAEA DISRAHTDFV ANASHELRTP
LASIIGYIET LADPDAKVDE ATAARFHATV LREARRLQSL VEDLMSLSRI EAEKHELPRD
RIDLGQLVGS IASETAMTVG DGRLEVETCP ALVAGDRQQL DQLVRNLIDN AFKYGDTAAP
VAVKVAIHGN EAELSVTDRG EGIHPDHLPY LTRRFYRTDP GRSRAAGGTG LGLAIVKHIV
ERHRGKLDIA SQLGIGTTVT VRLPIANLPA VAAA