Gene Saro_3982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3982 
Symbol 
ID5077512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp147458 
End bp149788 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content63% 
IMG OID640481088 
ProductType IV secretory pathway VirD4 components-like protein 
Protein accessionYP_001165750 
Protein GI146275589 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID[TIGR02759] type IV conjugative transfer system coupling protein TraD 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGTC CTGACACCTG GTCCGATCAG GCCCGCCCCG GCCAGCTCAA GCACCATTCG 
GCGCGCGGGA ACATGCCGCG CAACGCGGGC AATTTCACTC GCGGCTCGCA GCTCATCACG
CACGAATTCC TGATGTGGTT TTCCTCGGCG AAGATGCCGC TGCTGGTGTG GTTCTTCACG
TTCCTGATCG CGCTCTCGAT CGTCCTCGCG CTGCTGCTTC ACGAGCATGA AGTGCAGATG
ATCCTGATGC GGATCTATGC GGAGGGATGG AGCTTCATGG AGTTCAGCCC GCGCAAGATC
CTCAACCTAA CCTTGCCCTC CGGCCGCGTG ATCCCTGCCC CCGTTTCGAT GATCGCCAGC
CACCCTGATG TCGTCATCGC CTGGAACAAG CTGATGCGCG CAATTTGGGG TTCGCTGTTC
ATCTCGCTGT TCGTCGCGGT GCCGCTTTCG GTCTGGTTCA TCGACCTCTC GCGCAAGCGC
GGCAAGGCGA TCCTCGAAGA ACGCCACCAG CGCGGCGCGA TGCTGGTCGA CGCCAAGGAG
CTTGCCGCCG TGATCAACCA GCACAACAGC GCCGCGCTTG CCCAGGAGAT TGCCGAACGG
ATGCCCGGCA AGACCATGGA CGACGTCATG AAAATGAGCT TTGCCGAGCG CAAGGCCGCC
GGCATCCACC ACGTCTACAA CATCGCCGGC GTATCGTTTC CGTGGCGGAG CGAACAGGCT
CACACGATCA TGATCGGGTC GACCGGGACC GGCAAGACCA CGCAGATGCG GGACATGATC
GCGCAGATGC GCGTGCGCCA GGATCGGGCG GTCGTGTTCG ATCTTACCGG GGCCTACGTC
GAGGCGTTCT ATAACCCCGA GACCGACACG ATCCTCAACC CGATGGACGA GCGCTGCCCG
AGCTGGTCGC TGTTCGACGA GGGCAAGAAC TACGCCGACT TCACCGCGAT CGCATCAGCC
ATCTTGCCGA CCGACGGCGG CGGCTCGGAC CCCTTCTGGA TGCTGGGAGC AAGGACATTG
TTTGTGCAGA CCTGCGTCCA GCTCATGAAG CTCGGCCAGG CGACCAACGC CGCACTCGCC
TACCGGCTGA TGATGGCCGA CCTTGAAGAG GTCCACGAAC TGCTTCGCAA TACCATTGCC
GAGCCGCTGA CCGCGCCAGT CGCGGCGCGC ATGGCCGAGT CTGTCCGTGC AGTTCTCAAC
ACCAATGCCC AGGCCTTGTT GTTTATTCCC GAAGGCAAGG AACCCTTCTC GATTTGCGAC
TGGATTCGCC ACCAGGACAA GCCGGGCTCG ATCCTGTTCA TTACCTCTTC GCATAACGAA
CTGGTGCTCA ACCGGGCGCT CTTGTCGCTG TGGATGAACC TTGCGGTGCA TACCCTGATG
CGGCTGCCGC GCACCCGGTC ATTGCGCACC TGGTTCTTCT TCGACGAAGT CCATGCGCTG
CACCGCCTGC CAGCGATCGA AGACGGCTTG CAGACTGCGC GCGGCTTTGG CGGCGCCTTC
GTGCTCGGCA TCCATTCCTT CGCCAAGCTA GCCGAGACCT ATGGCAAGGA AGGCGCGCAG
AACCTTGCCT CGCTGGCCCG CACCAAGCTG ATCCTGGCAG CGGCCGATCG CGACACCGCC
GAGCACTGCT CGGACTACAT TGGTCACCGC GAAGTGCGGA TGATGGATGA GGCCTACAGC
TACGGCTATT CCAACATCCG CGACGCCGCG ACCATTACCC CGCGCTCGGA AGTGCAGCCG
CTGGTGATCC CCGATGACAT CATGCGCCTG CCTTCGCTGC GCGGGTTCCT GGTCTTTCCG
GAAGGGTTCG ATGCGGCGCG GATCAGGCTC ACCTACAAGG ACTACCCCAA GGTCGCAGAG
GGCTACATTC TGCGCGAGAA CGTCGAGCCT ATCGAGTTCA TCTCCATGCC CAAGGGTGAC
GATGAAGTCG CCGAGACCGG CGGCCGGGAC CGCAGCGGCG AACCGGAACT GGAGCCGCGC
GGCGAAGACC TTGGGCGCGA CCCCGCAGTG CCGTTGTCAC CGGCGCTCGA GCCCGACGCC
AATGCGCCGG AAATCATGCC CGATGGACCA AATCCTGATC CCAGTGTCGG CAAGCAGATG
GCATTTCGTC TGGAGCAAGC GCCTCAGAAC GAACGCACCA ATAGCCGCGA GCAGGACCCG
AGTCAGAAGG CTGAAAAGAG TTCCGCGTCC AAGCCTGCCG CGCAGCGCAC CGTCGGGTCC
CGCGAACTCA ATGACCCCGC CATCCCCGAC AAGGAATCTG AGCGCGGCGC GAAGACCGCC
AAGGGGATCG AGGACCAATC GCATTCCCGT GACGACGGAC CGGAGCTCTA G
 
Protein sequence
MRRPDTWSDQ ARPGQLKHHS ARGNMPRNAG NFTRGSQLIT HEFLMWFSSA KMPLLVWFFT 
FLIALSIVLA LLLHEHEVQM ILMRIYAEGW SFMEFSPRKI LNLTLPSGRV IPAPVSMIAS
HPDVVIAWNK LMRAIWGSLF ISLFVAVPLS VWFIDLSRKR GKAILEERHQ RGAMLVDAKE
LAAVINQHNS AALAQEIAER MPGKTMDDVM KMSFAERKAA GIHHVYNIAG VSFPWRSEQA
HTIMIGSTGT GKTTQMRDMI AQMRVRQDRA VVFDLTGAYV EAFYNPETDT ILNPMDERCP
SWSLFDEGKN YADFTAIASA ILPTDGGGSD PFWMLGARTL FVQTCVQLMK LGQATNAALA
YRLMMADLEE VHELLRNTIA EPLTAPVAAR MAESVRAVLN TNAQALLFIP EGKEPFSICD
WIRHQDKPGS ILFITSSHNE LVLNRALLSL WMNLAVHTLM RLPRTRSLRT WFFFDEVHAL
HRLPAIEDGL QTARGFGGAF VLGIHSFAKL AETYGKEGAQ NLASLARTKL ILAAADRDTA
EHCSDYIGHR EVRMMDEAYS YGYSNIRDAA TITPRSEVQP LVIPDDIMRL PSLRGFLVFP
EGFDAARIRL TYKDYPKVAE GYILRENVEP IEFISMPKGD DEVAETGGRD RSGEPELEPR
GEDLGRDPAV PLSPALEPDA NAPEIMPDGP NPDPSVGKQM AFRLEQAPQN ERTNSREQDP
SQKAEKSSAS KPAAQRTVGS RELNDPAIPD KESERGAKTA KGIEDQSHSR DDGPEL