Gene Saro_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1075 
Symbol 
ID3916371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1118771 
End bp1119778 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content64% 
IMG OID640443810 
Producttype II secretion system protein 
Protein accessionYP_496354 
Protein GI87199097 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2064] Flp pilus assembly protein TadC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.137371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGGA CTCCCCCCGG ACCGACGCTG CTCGGCTTCG ACGTCTACTT CGTCGGCTCG 
ATTCTCGTGG CCATCGCGGC TTTCGCGGTC ATGCTGGCAA TCTACGCCGC AGTCACCGTC
CGCGATCCGA TGGCCAAACG TGTCAAGGCG CTGAACGAGC GGCGCGAACA GCTCAAGTCA
GGCATCGTCA CCGCCAACGC CCGCAAGCGC ACGAGCATCG TCCGCCGCAA CCAGACGACC
GACCAGATCC GCGGTTTCCT CGAATCGCTG AAAGTCCTGC AGGACAGTCA GCTCGCGGTC
ATCCAGCAAA AGCTGGCGCA GGCCGGCATC CGCAAGAAGG AATGGGCGGT CGCCGTCATC
CTCGGCCGAC TCGTCGGACC GATCGCGCTC GGCCTGTTCG GCGCGGCCGT GTTCTATTGG
TCGAACACCT TCCCCGACTG GAGCCCGTTC AAGCGCTTCC TCGGCTTCGC GGTCTGCCTC
ATCGCGGGCT ACAAGGGACC GGACCTCTTC ATCCAGAACC TCGTGTCCAA GCGCACGGTT
GCTGTCCGCA AGGGCCTTCC CGATGCGCTC GACCTGCTGG TGATCTGCGC CGAGGCCGGT
CTTACGGTCG ACGCCGCCTT CAGCCGCGTC GCCCGCGAAC TTGGCCGCGC CTATCCCGAA
CTGGGCGACG AGTTTGCCCT GACTGCCATC GAACTGTCGT TCCTGACCGA GCGCAGGCAC
GCCTTCGAAA ACCTTGCCTA CCGCGTCGAC CTGGACTCGG TGAAGGGCGT GGTCACGACG
ATGATCCAGA CCGAACGCTA CGGCACGCCG CTGGCATCGG CCCTGCGCGT GCTGTCGGCG
GAGTTCCGCA ACGAGCGAAT GATGCGCGCC GAGGAAAAGG CCGCGCGCCT TCCCGCAATC
ATGACGATCC CGCTCATCCT TTTCATCCTG CCGGTGCTGT TCATCGTCAT TCTCGGCCCA
GCCGCATGCT CGATCAGCGA CAGCCTCGTC AACAAGAAGC CGGTCTGA
 
Protein sequence
MTRTPPGPTL LGFDVYFVGS ILVAIAAFAV MLAIYAAVTV RDPMAKRVKA LNERREQLKS 
GIVTANARKR TSIVRRNQTT DQIRGFLESL KVLQDSQLAV IQQKLAQAGI RKKEWAVAVI
LGRLVGPIAL GLFGAAVFYW SNTFPDWSPF KRFLGFAVCL IAGYKGPDLF IQNLVSKRTV
AVRKGLPDAL DLLVICAEAG LTVDAAFSRV ARELGRAYPE LGDEFALTAI ELSFLTERRH
AFENLAYRVD LDSVKGVVTT MIQTERYGTP LASALRVLSA EFRNERMMRA EEKAARLPAI
MTIPLILFIL PVLFIVILGP AACSISDSLV NKKPV