Gene Saro_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1076 
Symbol 
ID3916372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1119798 
End bp1120772 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content63% 
IMG OID640443811 
Producttype II secretion system protein 
Protein accessionYP_496355 
Protein GI87199098 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4965] Flp pilus assembly protein TadB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.195162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGCA TTATCCAGCT TCTGCTGATG TCCACCGGGC TGATGGTGGC GATGGTGCTG 
GGCTATGCCG CGCTGTCCGG TCCATCGGCC TCGAAGGAAG TCAACCGGCG CCTCCAGTCG
GTCCGCTTCC GTCACTCGGA AAGTACGCTG GACAAGGTCG AAGCGCAGTA TCGCAAGACG
CTCGCCGCGC GCAAGCCGAA GACCATGAGG CCCGCCGGGT CTTCGTCGCG GCTCGAGGCG
CTCGAACTGC GCCTGCACCG CACCGGCAAG GGCTGGACGC TTTCGCAGTA CCTCTACGTC
AGCGGCGGCC TCGCCATTCT GATCTTCCTG CTGGTGTACC TGCGCACAGG CGCGCCTCTG
CTGGCGCTCG GTTCCGGCAT TTTCATCGGC GGCGGCCTTC CCCACATGCT GGTCGGTCGT
GCGATCAACA AGCGGATCGA CAACTTCGTC ACCCGCCTGC CCGATGCCCT GGACCTGCTG
GTACGCGGCC TGCGCTCGGG CCTGCCCGTC ACCGAAACGC TCGGCGTCGT CGCGGCCGAA
CTGCCGGGCC CGGTGGGCGA GGAGTTCAAG CTGGTGACCG ACCGCATCAA GGTGGGCCGC
ACGATGGAAG AGGCTCTCCA GGACACGGCG GACCGGTTGA ACCTGCCGGA ATTCAACTTC
TTCTGCATCA CGCTGGCGAT CCAGCGCGAG ACGGGCGGCA ACCTCGCGGA AACGCTGTCG
AACCTGTCCG ACGTGCTGCG CAAGCGCGCA CAGATGAAGT TGAAGATCAA GGCGATGAGT
TCGGAATCGA AAGCCTCGGC GTATATCGTC GGCGCCCTGC CCTTCATCGT CTTCGCCCTG
ATCTACTGGA TCAACCCGGT CTATCTCGGA AAGTTCTTTG TGGACGAACG CCTCATCATC
GCCGGCCTTG GCGGCCTGAC CTGGCTCGGT ATCGGAGCCT TCATCATGGC CAAGATGGTC
AGCTTCGAAA TCTGA
 
Protein sequence
MTSIIQLLLM STGLMVAMVL GYAALSGPSA SKEVNRRLQS VRFRHSESTL DKVEAQYRKT 
LAARKPKTMR PAGSSSRLEA LELRLHRTGK GWTLSQYLYV SGGLAILIFL LVYLRTGAPL
LALGSGIFIG GGLPHMLVGR AINKRIDNFV TRLPDALDLL VRGLRSGLPV TETLGVVAAE
LPGPVGEEFK LVTDRIKVGR TMEEALQDTA DRLNLPEFNF FCITLAIQRE TGGNLAETLS
NLSDVLRKRA QMKLKIKAMS SESKASAYIV GALPFIVFAL IYWINPVYLG KFFVDERLII
AGLGGLTWLG IGAFIMAKMV SFEI