Gene Saro_1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1140 
Symbol 
ID3916436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1187406 
End bp1189598 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content69% 
IMG OID640443875 
Producthypothetical protein 
Protein accessionYP_496419 
Protein GI87199162 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAATC GTTTCACCTT CGCGCAAGGG ATCAGGTTTT CGATCGGGCT CCTGCTGGGG 
GCATCGGCGC CTGCGGCGCT GGCTCAGGCC GATGACGATC CCGATGGCGG CGGCATGGTC
TCGCGCCCGG TCGTCCAGTC CACGGCACCC AGCGCAAGCC GCGAACTCAA CGCGGCCCTT
GCCCGGCTGG CAGCCGATCC GCGCAACGTG CCCGCCCTGT TCGATGCGGC AGAGGCCGCG
CTTCGCCTTG GCGACAGCGA TGCGGCAATC GGCTTCGTCA CCCGCGCCAA TGAAGTGCAG
CCCAACAACC CGAGGGCAGG CGTGGTCATG GGGCGCGCCT ACCTGGTGGG CGAGGACCCG
GTCAGCGCCA TCCGGGCGTT TGACGAGGCC GAGCGTGCAG GAGGCGATAC CTTCGCGATG
GCTTCCGACC GTGGCCTTGC CTACGATCTT GTCGGCGACA ACGCCCGGGC GCAGCGCTGG
TACCAGGTCG CGCTCAGCCG GGGAGCGGAC GACGAAGTCA CCCGCCGTTA TGCGCTGAGC
CTCGCTATTT CCGGAGATCG CCGTGCTGCC GAAGCCCTGC TCGCGCCCCT GATCCAGCGG
CAGGATCGTC CTACCTGGCG CACCCGCACG TTCGTCATGG CCGTGACCGG TGGCCCGGAC
GAAGCGGTGG CCATTGCCTA TGCGACGATG CCGCAGGAAC TTGCCGCAGG CATCGCGCCC
TATCTGCGCT ACATGCCGCG CCTTACCACC GCGCAACAGG CCGCCGCCGC CAACCTTGGC
CGTTTCCCGC GTGCCGCGGA TATCGGCCGC GATGACCCGC GCATCGTCCA GTACGCCGCC
CTCAATCCGC GTGCGCCCCG CACCGCCGAG GCGGGCCTCA TCCCGTCGGG CGCGCCGCTC
GGTCCCGGCG TTGCAGCTTC GACAGACAAG GCGAGCCGCG AGAGGCGTCG GCGGCCGGGC
AGGGACGACA AGAAACTGGC GGCAGCCACT CCGGTCGCCA AGCCGGTCAG CGTGACGGTC
ACGCCCATTT CACCGCAACC GGCTGCGGCA CTTCCGCCGG CCGCTGCTCG TCCGGCGGCA
CCGTCCTCGC CGCAGGTGGC GTCCGCCGCT GCTGCATCGG TTCCGCAGCC CGCGTCGGCG
GCGCCCAGGC CGACGGTCCT CTCCGCTCTC GATCTTCCGC CGGGTGCCCC GAGACCCGCC
GTGCCGCCCA TCCGTGCAAC AACGCCCGCA ACGGTTTCGC GTCCGAGCGC GACGCCGCCG
GTGCCCGCCC GTGAAGTGGC GGCGCGGCCC GTCCCGGCAA CAACCGTCGC CCCGCCTGCC
GCTTCTTCCG CACCGGTCTC CGCGTCGCCC GTTCCGGCCT CCGCCGTTGC CAGGCCAGGA
AGCTTCGATC TGGCTCGGGT AGGCGGTTCG GCCCAGGTTG CGGCAGCAGC CCCGGTTCAG
GCGGCTGCAC CCGTTCCTCC GGCTTCCTCC GCCTCCAACG CTTCGGCATC GCTCGCGCCG
GTCGCGCTGC CGCCATCGCA GCCGGCGCAG GTCGCTGCGG CAACTGTCCC GCCTCCTGCT
GCTCCCGCGC CCGTTGCCGC TGCGGCAGCA CCGCAGCGTC CGGCCGACTT TGCGAGCCTG
TTCAAGAGCT TCACTCCGCC AGCGGAAGAG CGCACCGCCG ATCGTCCGGC CGTGGACATC
ACACGCCTTC CACCCAAGCC AGCGCCGAAG CCGGTTAAGC CCGACACGCG CGGCGAACGG
CCCGATGGCC CGATCGATGT GTCCCGCACG TCCGCGAAGG ATGCCGAAAA ACCCTCGCCG
AAGGACCTCA AGGCGGGCTT GAAAGACACC AAGACCGCCC CGAAGGACAC CAAGACGGGC
GCGAAGGATG CCAAGGCAGC CGCAAAGGAC GCCAAGGCAA AGAAGGCGGA GCCTTCTCAC
CCCAGCCGCA TCTGGGTCCA GGTGCTTACC GGGGCAAACA AGGACGTGAT GGACAACGAG
TGGCGCCGTA TCGTTAAGGA GGCACCCGAG GTCCTGCGCG GTCGCAAGCC TTTCCTGTCG
CCCTGGCGCA ACAATTTCCG CTTGCTCACC GGTCCCTTCG AGAGCGAAGC GGCGGCGCAG
GAATTCATTG GAAAACTTCG TAAAAGTGGC GTTTCCAGCT ATCAGTGGAC GAGCCCGGCG
GGCCAGCCGG TCGATACTCT CGCGCTGAAG TAG
 
Protein sequence
MTNRFTFAQG IRFSIGLLLG ASAPAALAQA DDDPDGGGMV SRPVVQSTAP SASRELNAAL 
ARLAADPRNV PALFDAAEAA LRLGDSDAAI GFVTRANEVQ PNNPRAGVVM GRAYLVGEDP
VSAIRAFDEA ERAGGDTFAM ASDRGLAYDL VGDNARAQRW YQVALSRGAD DEVTRRYALS
LAISGDRRAA EALLAPLIQR QDRPTWRTRT FVMAVTGGPD EAVAIAYATM PQELAAGIAP
YLRYMPRLTT AQQAAAANLG RFPRAADIGR DDPRIVQYAA LNPRAPRTAE AGLIPSGAPL
GPGVAASTDK ASRERRRRPG RDDKKLAAAT PVAKPVSVTV TPISPQPAAA LPPAAARPAA
PSSPQVASAA AASVPQPASA APRPTVLSAL DLPPGAPRPA VPPIRATTPA TVSRPSATPP
VPAREVAARP VPATTVAPPA ASSAPVSASP VPASAVARPG SFDLARVGGS AQVAAAAPVQ
AAAPVPPASS ASNASASLAP VALPPSQPAQ VAAATVPPPA APAPVAAAAA PQRPADFASL
FKSFTPPAEE RTADRPAVDI TRLPPKPAPK PVKPDTRGER PDGPIDVSRT SAKDAEKPSP
KDLKAGLKDT KTAPKDTKTG AKDAKAAAKD AKAKKAEPSH PSRIWVQVLT GANKDVMDNE
WRRIVKEAPE VLRGRKPFLS PWRNNFRLLT GPFESEAAAQ EFIGKLRKSG VSSYQWTSPA
GQPVDTLALK