Gene Saro_1523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1523 
Symbol 
ID3917198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1567809 
End bp1568795 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content59% 
IMG OID640444264 
Producttype II secretion system protein 
Protein accessionYP_496798 
Protein GI87199541 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4965] Flp pilus assembly protein TadB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCTCG AACTTCTCAG GTTCTTTGCC CTGTTGGCTA TCTTCGTATT CGTCGTGATC 
GTCGTGCAGC AGGCCGCCAA TGCAACGCTC ACGCGGCAGC GCAATCGCAA CGCGGTCAAC
AAGCGGCTGA AGCTGCTCGC GTCGGGGATG GATCGGGAAG CGGTCACAAA TCTTCTCAAG
GTCAATCACG CAGGGTCTTT CAGGCGCCGC GGGATTGTCG TCCGGCTTCG CAGGATGCTT
GCCAGAACCG GCATGAGCAT AACGCCGGAC AGGATATTGG CGGGAATGGC CATCGCGACT
GCGGTGGCCA TAATACTGCT GAGCCTTCTG GCGTTCAGTC TCGGCACCAC GATAACCTTG
GGTACGATTC TGCTGATTTG TGTCGTCGGC CTTGGTTTCG GGGCCGGCAT TCCCTACGTC
GTTTTGAACC GTAGCGCGGA AAAGCGAAGG AAGCGGATGG AGCAGCAGTT CGCTCCCGCC
GTGGACATCT TCACGCGCGC CTTGCGAGCA GGGCATCCTG TAGCTTCGGC GATCAGCCTC
CTGTCCACTG AGATGGCGGA TCCAATCGGC ACAGAGTTCG GCCTCGTCGC GGACGAGATC
GCCTATGGCG CGAACCTCAA TGACTCCCTG GTAGCACTTG CGGAACGCTG GGAACTGGAA
GATCTGCACA TGTTCGCAGT CTGCCTGTCA GTGCAGAGCG AAACGGGCGG CAATCTGGCT
GAGATCCTTG GAAACCTTGC GTCGGTCATT CGCGACCGCG CGAGCCTTTA CCAGAAGGTG
CGCGCGTTGA GCTCCGAAGG GCGGGCAAGC GCCTGGATGC TTTCGGTCCT GCCCGCACTG
ACGATGCTTG TCCTCTTCAT GATAAACCCG AGCTTCTACC TTGAGGTGTC GGGCGACCCG
CTGTTCGCGA CAAGTTTCGC CGGCATGATC GGTCTCTATC TCATCGGCGT GCTGTGGCTT
CGCAGCATGG TTGACCTGAA GGTGTGA
 
Protein sequence
MTLELLRFFA LLAIFVFVVI VVQQAANATL TRQRNRNAVN KRLKLLASGM DREAVTNLLK 
VNHAGSFRRR GIVVRLRRML ARTGMSITPD RILAGMAIAT AVAIILLSLL AFSLGTTITL
GTILLICVVG LGFGAGIPYV VLNRSAEKRR KRMEQQFAPA VDIFTRALRA GHPVASAISL
LSTEMADPIG TEFGLVADEI AYGANLNDSL VALAERWELE DLHMFAVCLS VQSETGGNLA
EILGNLASVI RDRASLYQKV RALSSEGRAS AWMLSVLPAL TMLVLFMINP SFYLEVSGDP
LFATSFAGMI GLYLIGVLWL RSMVDLKV