Gene Saro_1522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1522 
Symbol 
ID3917197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1566401 
End bp1567804 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content59% 
IMG OID640444263 
Producttype II secretion system protein E 
Protein accessionYP_496797 
Protein GI87199540 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTGGG AAATCCGCCG CAGGAACGTA AAGCCAAGAT CGCCCATCGC GGTACCGGCC 
CTGCCTTTGC CCGGCGATAC GATGTTGACC CCAGACAGGA CGGACGAGGC TCAGGTCGGT
GCCGCTCCCG CCGCAAATGA CGTTCTGCTT GGCATCAAGG TCGACATCCA TCGAGAGCTT
CTCGACCGCG TCAATCTGGC GGCCATCGAA AAGCTTTCGC GAACCGACCT GGTTCGTGAA
CTCTCTGACA TCATCGGCGG CATCCTGACC GAACGGAATA TCGCGCTCAA TCGTGTCGAG
CGCGAAGATC TCGTTGAAGA CATCGTCGAT GAACTGGTCG GCCTCGGTCC GCTTGAGCCA
CTCATCAAGG ATGACAGCAT ATCGGACATT CTCGTCAACG GTTACGAGAC AGTTTTCGTC
GAACGCGGCG GCAAACTGCA GCGAGTATCG ACGCGGTTCC AGGATGAGCG GCACCTCCTG
CGCATAATCC AGAAGATTGT CAGTGCCGTA GGTCGCCGCG TCGACGAATC CTCGCCATTT
GTCGATGCGA GACTGGCGGA CGGTTCCCGC GTAAATGCGA TCGTTGCGCC GCTTGCTATC
GACGGATCAC TGTTGTCGAT CCGCAAGTTC TCCAAGAAGC CGATCAGCAT GGCCCGAATG
ATCGAGATTG GCAGCTTGTC AGAACCAATG GCGATTCTGC TCAAGGCCGT GGTTGAAGGT
CGTCTCAACA TCATCATCTC TGGCGGCACC GGCTCGGGCA AGACGACGAT GCTCAATGCC
TTGTCTTCGT ACATCGATGG CACCGAACGT ATCGTCACGA TCGAGGACTC GGCCGAACTT
CAACTCCAGC AGGAGCACGT TGCGCGTTTG GAGACGCGCC CCCCCAACAT CGAGGGGCGC
GGTGAGGTCA GCCAACGCGA TCTGGTCAAG AATGCCCTGC GCATGCGGCC TGACCGGATC
ATCCTGGGGG AATGCCGTGC GGGCGAAGCC TTCGATATGC TTCAGGCGAT GAACACGGGG
CATGACGGCT CGATGACGAC GGTACATGCA AACACTCCGC GCGATGCGCT GACGCGTATT
GAACAGATGG TTGGCATGAG CGGCATCGAT ATTGCGCCTC GTTCGGTCCG GGCCCAGATC
GGCTCGGCCG TCAACGTCGT GATCCAGATC GGCCGTCTTT CCGACGGTCG ACGCAAGACT
CTCAGCATTT CCGAATTGAC CGGGATGGAG GGGGAAACGA TCACCATGCA GGAGATTTTC
CGCTTCAACC AGCGTGGGCG CGACGAGCTC GGCAACGTCA TTGGCCATTT CGAAGCGACC
GGCATCCGCC CCCGGTTCGC TGCACGCCTC GAGGCGAGTG GCATCCACCT CGCCGCCGAT
CTATTCAAGC CGACGATGGG GTGA
 
Protein sequence
MNWEIRRRNV KPRSPIAVPA LPLPGDTMLT PDRTDEAQVG AAPAANDVLL GIKVDIHREL 
LDRVNLAAIE KLSRTDLVRE LSDIIGGILT ERNIALNRVE REDLVEDIVD ELVGLGPLEP
LIKDDSISDI LVNGYETVFV ERGGKLQRVS TRFQDERHLL RIIQKIVSAV GRRVDESSPF
VDARLADGSR VNAIVAPLAI DGSLLSIRKF SKKPISMARM IEIGSLSEPM AILLKAVVEG
RLNIIISGGT GSGKTTMLNA LSSYIDGTER IVTIEDSAEL QLQQEHVARL ETRPPNIEGR
GEVSQRDLVK NALRMRPDRI ILGECRAGEA FDMLQAMNTG HDGSMTTVHA NTPRDALTRI
EQMVGMSGID IAPRSVRAQI GSAVNVVIQI GRLSDGRRKT LSISELTGME GETITMQEIF
RFNQRGRDEL GNVIGHFEAT GIRPRFAARL EASGIHLAAD LFKPTMG