Gene Saro_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1066 
Symbol 
ID3916362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1108934 
End bp1110097 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content65% 
IMG OID640443801 
Producthypothetical protein 
Protein accessionYP_496345 
Protein GI87199088 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCATGCCC CGTCCTTAAC CCTGATGCCC GATCTGTCGC GCCCGGCGGC CGAACTGATC 
ACCAGCATGG AGCCCTGGGA CCAGTGGGCA GGCGCGCAAG CGATAAGCCT TTGGGCGGAG
CTTGCCGATG CGGCGACGAC GCCGAACCCC TTCTTCGAAC ACTGGTATCT CCTTCCCGCA
CTGGAAGCCT TCGATGCCGA CCGTGCGGCA CGCATCCTCG CGATAAGGAC CGGCGGCAGG
CTGATCGGGC TCATGCCGAT CGTCATGCAA CGCAGCTATC AACGCTGGCC CCTGCGACAC
CTTGCCTCGT GGCAGCACGC CAATGCCTTC CTTGGTACAC CGCTGGTGCG CGCCGGATGC
GAGACACTGT TCTGGCAAGG ATTGCTGGAT TGGGCCGACA AGGCAGCGCC CGACGATGGC
GCGCTGTTCC TGCACCTGCC GGCAATGCCC CTCGAGCAGC CGCTGACCCA GTGCCTGATG
GACCTGTGCT TCGAAAATGG CCGTCCCGCG GGACTGGTGA TGCGCGAACA GAGGGCCTTG
CTGCACTCTC CGCTGAGCCC GGAAGCCTAT CTGGAACGTG CGCTGCGAGG CAAGAAGCGC
AAGGAACTTC GCCGCCAACA TGCCAGGCTT GCCGAACAGG GAGCGCTGGC GTTCGAGCGG
CGCGAGGATG CGCAAGGGAT CGACGCGTGG ATCGAGACCT TTCTGGCGCT TGAAGCGGCT
GGCTGGAAAG GGCGGTCGGC AAGCGCCATG GCCTTTGCCC CCGAAACCGC ATCGCTCTTC
CGCCAGGCAC TGATCCAGGC GGCGGCGCTC GGTAAGCTGG AACGGCTTTC GCTGACGCTG
GACGGGCGAC CTGTCGCGAT GCTGGCGAAC TTCATCACGC CGCCGGGCAG CTTCTCCTAC
AAGACCGCAT TCGACGAGAC GCTCGCCCGC TTCTCGCCCG GCGTCCTCCT GCAACTGGAA
AACCTCGCAT TGCTGCGCCG GGACGACGTG ACCTGGTGCG ATAGCTGCGC CGCGCCCGAT
CACCCGATGA TCGACAGCAT CTGGACCGAA CGGCGCCCCA TCGGGCGCCT GTCGGTCGGC
ATCGGTGGCA AGATCCGGCG TGCGATCTTC AAAACGACGC TTGCCCTCGA ACTGCAACGC
AACCCGACTG GAATCGGTGC ATGA
 
Protein sequence
MHAPSLTLMP DLSRPAAELI TSMEPWDQWA GAQAISLWAE LADAATTPNP FFEHWYLLPA 
LEAFDADRAA RILAIRTGGR LIGLMPIVMQ RSYQRWPLRH LASWQHANAF LGTPLVRAGC
ETLFWQGLLD WADKAAPDDG ALFLHLPAMP LEQPLTQCLM DLCFENGRPA GLVMREQRAL
LHSPLSPEAY LERALRGKKR KELRRQHARL AEQGALAFER REDAQGIDAW IETFLALEAA
GWKGRSASAM AFAPETASLF RQALIQAAAL GKLERLSLTL DGRPVAMLAN FITPPGSFSY
KTAFDETLAR FSPGVLLQLE NLALLRRDDV TWCDSCAAPD HPMIDSIWTE RRPIGRLSVG
IGGKIRRAIF KTTLALELQR NPTGIGA