Gene Saro_2644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2644 
Symbol 
ID3918418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2880616 
End bp2882082 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content66% 
IMG OID640445421 
Productbacteriophage N4 adsorption protein B 
Protein accessionYP_497914 
Protein GI87200657 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGGACAT GGCTTGCCGA TAGTGCGTAT CAGTGGCTCG CCGTTCTCGA GCATGAGCTT 
CTGTTGTTCG CGGCGGTGTG GTTCGCGATC GGGGCGGCTG ACGAATTGGT CATGGACGGG
ATCTGGCTCT GGCAGCGCCT GACCGGCGCG GGCCCGACCG GGCAACTGGC CGGCAACGGT
CGCGACAAGC TTTCCTCCAT GGCAGCGGTT TTCGTGCCAG CCTGGCGGGA GTCCGCCGTG
ATCGGGCCCA TGGTCGCGCA TTGCCTTGCG GTCTGGCCGC AGGAAGATTT GCGGATCTAC
GTGGGATGCT ACCGCAACGA CCAGGAAACG CTGAATGCGC TGACGATCGT CAGCGAGGAC
CCGAGGGTCC GGGTGGTCGT CCATGACCGT GACGGACCGA CCACCAAGGC AGACTGCCTC
AACCGTCTCT ACCTCGCCAT GCGCCAGGAC GAACGGCGCA GCGGACAGCG GATCGGCTTC
ATTGTGCTGC ATGACGCGGA GGACATGGTC CATCCCGCAG CCCTCGCACT GATGGATCGG
GCGCTTGATA CGGTCGATTT CGTACAACTG CCGGTGCGTC CCGAGCCGCA GGCATCCTCG
CCGTGGGTCG CCGGACACTA CTGCGACGAG TTCGCCGAGG CGCACGCGAG AGACATGGTG
GTGCGCGATC ATATCGGGGC CGGGTTGCCT TCGGCCGGCG TCGGATGTGC GTTCTCGCGC
GCCGCGATCG AGCGCATCGT GGCGGTGCGC GGAGGCGCCT TGCCGTTTGC GGCGGACTGC
CTGACGGAAG ACTACGAGGC GGGCATGCTG GTCGCCGAGA CAGGTGGCCG TTCGCGCTTC
ATCAGGGTGC GCGATGCGCG CGGGGAGCTT GTCGCGACCC GGGAGTTCTT TCCCGATGGC
CTTGCAGCAT CGGTGCGGCA GAAGACGCGC TGGGTGCACG GGATCGCATT CCAGGGTTGG
GATCGGCTGG GATGGAACCG GTCCGCCGGA GACCTGTGGA TGCGCCTGAG GGACAGGCGG
GGGCCGCTCG TGGCGCTGGT TCTGCTTGCG GCATACCTGG CCTTGCCGCT GTGGCCCATC
GTGAGGTTCG GCGAGATGGC GGGCTTCGTC GTGCCGGTGC CACCCGGCCC AGTGCTGAAG
GGTCTGCTTG CCTTCAACCT TTGCAGCCTG ATCTGGCGGC TAGTCGTCCG GGCGCTGTTC
ACCGGCAGCG AGTACGGGTG GATCGAAGGT GTACGGTCGG TTTTCCGGTT TCCCGTGGGC
AACATCATCG CGATAATGGC CGCGCGCCGC GCTGCGGTGG CTTATGTCCG GGTGCTTTTC
GGGGGAGCGC TTACCTGGGA CCACACACTA CACTGCGCAC ACCCGGTGCA GGCCGGGGTC
GGCCTTGCCA GCCACGCTAG TTCGACCCAA CGCCCACGGC GGCCTGCAAG CAGCGGTCTT
GTCCCGGCGG CGCTGGTCGG CAACTGA
 
Protein sequence
MRTWLADSAY QWLAVLEHEL LLFAAVWFAI GAADELVMDG IWLWQRLTGA GPTGQLAGNG 
RDKLSSMAAV FVPAWRESAV IGPMVAHCLA VWPQEDLRIY VGCYRNDQET LNALTIVSED
PRVRVVVHDR DGPTTKADCL NRLYLAMRQD ERRSGQRIGF IVLHDAEDMV HPAALALMDR
ALDTVDFVQL PVRPEPQASS PWVAGHYCDE FAEAHARDMV VRDHIGAGLP SAGVGCAFSR
AAIERIVAVR GGALPFAADC LTEDYEAGML VAETGGRSRF IRVRDARGEL VATREFFPDG
LAASVRQKTR WVHGIAFQGW DRLGWNRSAG DLWMRLRDRR GPLVALVLLA AYLALPLWPI
VRFGEMAGFV VPVPPGPVLK GLLAFNLCSL IWRLVVRALF TGSEYGWIEG VRSVFRFPVG
NIIAIMAARR AAVAYVRVLF GGALTWDHTL HCAHPVQAGV GLASHASSTQ RPRRPASSGL
VPAALVGN