Gene Saro_0727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0727 
Symbol 
ID3918551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp768182 
End bp769204 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content67% 
IMG OID640443459 
ProductKpsF/GutQ family protein 
Protein accessionYP_496008 
Protein GI87198751 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.516394 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAA GGTTCGACAG CGATCCCTTC GAGCCGGTCG CGACAGAGGC GGTCACCCGA 
TCGATTCCGC CCCGCAAGAT CGGTCGTGAG GTCGTGATCG CGGAAAGCGC AGGGCTTGCT
GCACTTGCCG AGGCGCTGGA TGCCTCGTTC GACGCTGCGG TGAGTCTACT GCATTCCGGC
GGCGGTCGCG TGTTCGTGAG CGGGGTGGGC AAATCGGGCC ACGTCGCCCG AAAGATAGCC
AGCACCTTAT CGTCCACCGG TCGTCCGGCG TGCTTCATTC ATCCGGTGGA GGCCATGCAT
GGCGATCTTG GAATGCTGTG CCCCGGGGAC GTGCTGATCG TGCTGTCCAA TTCGGGAGCG
TCGATGGAAC TGCGCGGCCT AGTCGACCAT GCGCAGCGTC TTTCGGCAAG GATCGTGGCG
ATCGGGGCCC GGCCGGACTC TCCGCTGATG CGGGTGGCGG ACATCGCGCT CGTCATTCCC
GATGGCCCCG AAGCATGCCC GGTCAACATT GCGCCAACCA CATCGACCAC GATGATGCTG
GCACTGGGCG ATGCGCTTGC CGTGGCTGTG ATGAGCGCGC GCGGCATCGG GGTAGAGCGC
ATCAGGCTGC TCCACCCGGG AGGCCCGATC GGAGAGCGGC TCCGCGTCGC GGAGGACGTG
ATGCGAACCG ACGCCCTGCC GCTGGTGGGG GTCGAGGACC CGATGCCCGA AGTGCTGTTG
TGCATGGCGC GATCAGGCCT CGGAATCGCG GGCGTGGTCG CCTTGGGTGG AGGACTGGTC
GGCGTGATCG AGGCGGACAG GCTGCCAGCC GTGGCCCGCG ACCTCGCCGG GGAGCGGGCC
GGGTTTTTGA TGAACCGCCA CGCCTGGGTC GCAAGGCGGG AAACGCCCTT GGACGAAATC
GCCCGGAACC TGGGTGTCGG CGGGAGCGAT GCGGCCCTCG TGATTGCGGG CGAGAACGAT
CGCAGGCCGA TCGGCGTCGT CAGCGCTCGG AACCTCGGCA CGTCGGGAGC GTGGCCGGCA
TGA
 
Protein sequence
MKTRFDSDPF EPVATEAVTR SIPPRKIGRE VVIAESAGLA ALAEALDASF DAAVSLLHSG 
GGRVFVSGVG KSGHVARKIA STLSSTGRPA CFIHPVEAMH GDLGMLCPGD VLIVLSNSGA
SMELRGLVDH AQRLSARIVA IGARPDSPLM RVADIALVIP DGPEACPVNI APTTSTTMML
ALGDALAVAV MSARGIGVER IRLLHPGGPI GERLRVAEDV MRTDALPLVG VEDPMPEVLL
CMARSGLGIA GVVALGGGLV GVIEADRLPA VARDLAGERA GFLMNRHAWV ARRETPLDEI
ARNLGVGGSD AALVIAGEND RRPIGVVSAR NLGTSGAWPA