Gene Saro_2420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2420 
Symbol 
ID3916739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2598959 
End bp2600401 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content67% 
IMG OID640445175 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_497690 
Protein GI87200433 
COG category[S] Function unknown 
COG ID[COG3538] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.226342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCG ACCGGCGACG GATGATGGCG GGTGCGGCGG CACTGGGCGG GATGGCCGCG 
CTTGGCCCGA TGCGCGCCCT CGCCGCGCAG GCTGCCACTG CGACTGCCCG TCCGGAACCT
GCCGACCGCC TGTTCGCCAG CCCCGCCGTG GAACGCGAGA TTGCGCGCGT ATCGGCATTG
ATCGCCGATG CCGACCTCCG GCGGCTCTTC GTCAACTGCT ATCCGAACAC GCTCGACACC
ACGGTCCATC TATCCAGCGT CGAGGGTCGT CCCGACGCTT TCGTCATCAC CGGCGACATC
GACTGCATGT GGTTGCGCGA CAGTTCGGCG CAGCTCAATC CCTACCTGCA CCTCGTGCGC
GAGGACGAGG CGTTGCGCGG TCTGTTCCGT GGGCTGATCG CGCGGCAGGC GCGCTCGATC
CTGATCGACC CCTATGCCAA TGCATTCATG CGCGACCCGT CGGCAAGCAC GAACCTGCCA
TGGGCGCTCG CCGACGATAC CGAGATGAAG CCGGGCGTGG CGGAGCGGAA GTGGGAAGTG
GATTCGCTCT GCTACCCGAT GCGCCTTGCC CATGACTACT GGAAGGCGAG CGGCGACACC
GCGCCGTTCG ACGCGCTCTG GGCCGAGGCG GCCTGGGCCA GCATCCGCAC CTTCCGCGAA
CAGCAGCGCA AGGACGACCC CGGCCCCTAC CGCTTCCTGC GCCGCGACAA GCTGGCGACC
GAGACGCAGA TCCTTGGCGG CTATGGCGCG CCGACGCGCA AGGTGGGCAT GATCCACAGC
ATGTATCGCC CTTCCGACGA TGCCTGCGTG TTTCCCTTCC TGGTGCCGTC GAACCTCTTC
GCCGTCGCCG CCCTGCGCAA GCTGGCGGCG CTGGCGGGTG CGGTGCAGCA GGCCAAGCTT
GCCAGTGCCG CGCTGGACCT GGCGCGCGAG GTGGAGCTGG CCACCTATGC CAACGGCACG
ATCATCGATC CGGCCAGCAA CGAACGGCTC TGGGCCTACG AGGTCGACGG CTTCGGCAAC
GGACACTTCA TGGACGATGC CAACGTGCCC AGCCTGTCGA GCCTTGCCTA TCTCGGCGCG
GTCCCTTCGG ACGATCCGCT GTTCCTGCGC ACCCGCGCCG CCGCGTGGAG CGAGCGCAAT
CCGTACTTCT TCAAGGGCAC CGCCGCGGAA GGCATCGGCG GCCCCCACGC CGGGCTGCGC
ATGATCTGGC CGATGGCAAT CACCATGCGC GCGCTGTCGA GCGACGACGA CGCAACGATC
CGCCAGTGCC TGGCCATGCT CAAGGCCAGC CACGCCGGCA CCTTCTTCAT CCACGAGGCT
TTCGACCAGG ACGATCCGGC GAAGTTCACC CGCCACTGGT TCGCCTGGGC CAACGGCCTG
TTCGGAGAGC TGATGATAGA CCTCGCCAAC CGCAAGCCGG CATTGCTGGG AGAAGCCGCA
TGA
 
Protein sequence
MKIDRRRMMA GAAALGGMAA LGPMRALAAQ AATATARPEP ADRLFASPAV EREIARVSAL 
IADADLRRLF VNCYPNTLDT TVHLSSVEGR PDAFVITGDI DCMWLRDSSA QLNPYLHLVR
EDEALRGLFR GLIARQARSI LIDPYANAFM RDPSASTNLP WALADDTEMK PGVAERKWEV
DSLCYPMRLA HDYWKASGDT APFDALWAEA AWASIRTFRE QQRKDDPGPY RFLRRDKLAT
ETQILGGYGA PTRKVGMIHS MYRPSDDACV FPFLVPSNLF AVAALRKLAA LAGAVQQAKL
ASAALDLARE VELATYANGT IIDPASNERL WAYEVDGFGN GHFMDDANVP SLSSLAYLGA
VPSDDPLFLR TRAAAWSERN PYFFKGTAAE GIGGPHAGLR MIWPMAITMR ALSSDDDATI
RQCLAMLKAS HAGTFFIHEA FDQDDPAKFT RHWFAWANGL FGELMIDLAN RKPALLGEAA